Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Second Derivatives and Laplace Operator01:22

Second Derivatives and Laplace Operator

2.4K
The first order operators using the del operator include the gradient, divergence and curl. Certain combinations of first order operators on a scalar or vector function yield second order expressions. Second-order expressions play a very important role in mathematics and physics. Some second order expressions include the divergence and curl of a gradient function, the divergence and curl of a curl function, and the gradient of a divergence function.
Consider a scalar function. The curl of its...
2.4K
Application of Linearization and Approximation01:29

Application of Linearization and Approximation

193
A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...
193
Graphing Antiderivatives01:30

Graphing Antiderivatives

181
The concept of an antiderivative is fundamental in calculus, describing how a function's values accumulate over time. This process is closely related to physical motion, such as the movement of a rolling ball. As the ball progresses, its position changes in response to variations in velocity, just as an antiderivative graph reflects the cumulative effect of the original function's values.Graphing an antiderivative requires interpreting how a function's values influence the shape of its...
181
Linearization and Approximation01:26

Linearization and Approximation

233
Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...
233
Definition of Laplace Transform01:22

Definition of Laplace Transform

5.1K
The Laplace transform is an indispensable mathematical technique for simplifying the resolution of differential equations by converting them into more manageable algebraic expressions. The Laplace transform of a function is denoted by L[x(t)], where x(t) is the time-domain function. The laplace transform is mathematically expressed as
5.1K
Graphs of Functions01:30

Graphs of Functions

550
Graphs of functions provide a visual representation of how output values change in response to varying inputs. Each point on the graph corresponds to an ordered pair, where the x-coordinate (independent variable) determines the horizontal position and the y-coordinate (dependent variable) determines the vertical position. Linear functions like y = x give a straight line, indicating a constant rate of change.Nonlinear functions display more complex behaviors. Even power functions generate...
550

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Local Surrogate Models With Residual Fuzzy Rules for Model-Agnostic Explanations.

IEEE transactions on cybernetics·2026
Same author

A Prediction Model Integrating Adaptive-Network-Based Fuzzy Inference System and Fuzzy C-Mean Clustering.

IEEE transactions on cybernetics·2026
Same author

Individual Linguistic Granular Computing: A Granulation-Degranulation-Based Approach.

IEEE transactions on cybernetics·2026
Same author

S<sup>2</sup>FS: Spatially-Aware Separability-Driven Feature Selection in Fuzzy Decision Systems.

IEEE transactions on neural networks and learning systems·2026
Same author

Data-Driven Cation Engineering Guides Electrolyte Design for Sustainable Aqueous Zinc Battery Chemistries.

Advanced materials (Deerfield Beach, Fla.)·2026
Same author

A complex-valued widening spiking neural network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

A New Human-Likeness and Comfort Index for Robot Movements Along Prescribed Paths.

IEEE transactions on cybernetics·2026
Same journal

Robust Semiglobal and Global Stabilization for Nonlinear Normal Form Systems by Time-Varying Feedback.

IEEE transactions on cybernetics·2026
Same journal

Adaptive Global Asymptotic Output Stabilization of Uncertain Nonlinear Systems Under Dynamic State/Input Quantization.

IEEE transactions on cybernetics·2026
Same journal

Accelerated Distributed Gradient Tracking for Constrained Aggregative Optimization Over Time-Varying Digraphs.

IEEE transactions on cybernetics·2026
Same journal

Small-Gain-Based Plug-and-Play Distributed Control Framework for DC Microgrids With Decentralized Reconfiguration.

IEEE transactions on cybernetics·2026
Same journal

Prescribed-Time Impulsive Control of High-Order Integrator Systems.

IEEE transactions on cybernetics·2026
See all related articles

Related Experiment Videos

A clustering-based graph Laplacian framework for value function approximation in reinforcement learning.

Xin Xu, Zhenhua Huang, Daniel Graves

    IEEE Transactions on Cybernetics
    |May 8, 2014
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a novel clustering-based graph Laplacian framework for reinforcement learning (RL) feature representation and value function approximation (VFA). The new method efficiently generates basis functions, improving control performance in continuous state spaces with fewer samples.

    Related Experiment Videos

    Area of Science:

    • Artificial Intelligence
    • Machine Learning
    • Control Theory

    Background:

    • Sequential decision problems with large or continuous state spaces are challenging in reinforcement learning (RL).
    • Feature representation and value function approximation (VFA) are critical research areas for addressing these challenges.
    • Existing methods may require extensive data or computational resources.

    Purpose of the Study:

    • To present a clustering-based graph Laplacian framework for feature representation and VFA in RL.
    • To enable efficient handling of continuous state spaces in Markov decision processes (MDPs).
    • To improve the performance and sample efficiency of RL algorithms.

    Main Methods:

    • Constructing a graph Laplacian using clustering techniques (K-means, Fuzzy C-means) via subsampling in continuous state MDPs.
    • Generating basis functions for VFA through spectral analysis of the graph Laplacian.
    • Integrating the framework with representation policy iteration (RPI) algorithms.

    Main Results:

    • The proposed approach automatically generates basis functions for VFA.
    • Fewer sample points are needed to compute efficient basis functions compared to previous RPI methods.
    • Improved learning control performance was observed across various parameter settings.

    Conclusions:

    • The clustering-based graph Laplacian framework offers an efficient solution for feature representation and VFA in RL with continuous state spaces.
    • This method enhances sample efficiency and control performance.
    • The framework provides a robust approach for tackling complex sequential decision problems.