Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Second Derivatives and Laplace Operator

Second Derivatives and Laplace Operator

The first order operators using the del operator include the gradient, divergence and curl. Certain combinations of first order operators on a scalar or vector function yield second order expressions. Second-order expressions play a very important role in mathematics and physics. Some second order expressions include the divergence and curl of a gradient function, the divergence and curl of a curl function, and the gradient of a divergence function.
Consider a scalar function. The curl of its...

Application of Linearization and Approximation

Application of Linearization and Approximation

A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...

Graphing Antiderivatives

Graphing Antiderivatives

The concept of an antiderivative is fundamental in calculus, describing how a function's values accumulate over time. This process is closely related to physical motion, such as the movement of a rolling ball. As the ball progresses, its position changes in response to variations in velocity, just as an antiderivative graph reflects the cumulative effect of the original function's values.Graphing an antiderivative requires interpreting how a function's values influence the shape of its...

Linearization and Approximation

Linearization and Approximation

Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...

Definition of Laplace Transform

Definition of Laplace Transform

The Laplace transform is an indispensable mathematical technique for simplifying the resolution of differential equations by converting them into more manageable algebraic expressions. The Laplace transform of a function is denoted by L[x(t)], where x(t) is the time-domain function. The laplace transform is mathematically expressed as

Graphs of Functions

Graphs of Functions

Graphs of functions provide a visual representation of how output values change in response to varying inputs. Each point on the graph corresponds to an ordered pair, where the x-coordinate (independent variable) determines the horizontal position and the y-coordinate (dependent variable) determines the vertical position. Linear functions like y = x give a straight line, indicating a constant rate of change.Nonlinear functions display more complex behaviors. Even power functions generate...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Local Surrogate Models With Residual Fuzzy Rules for Model-Agnostic Explanations.

IEEE transactions on cybernetics·2026

Same author

A Prediction Model Integrating Adaptive-Network-Based Fuzzy Inference System and Fuzzy C-Mean Clustering.

IEEE transactions on cybernetics·2026

Same author

Individual Linguistic Granular Computing: A Granulation-Degranulation-Based Approach.

IEEE transactions on cybernetics·2026

Same author

S<sup>2</sup>FS: Spatially-Aware Separability-Driven Feature Selection in Fuzzy Decision Systems.

IEEE transactions on neural networks and learning systems·2026

Same author

Data-Driven Cation Engineering Guides Electrolyte Design for Sustainable Aqueous Zinc Battery Chemistries.

Advanced materials (Deerfield Beach, Fla.)·2026

Same author

A complex-valued widening spiking neural network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

A New Human-Likeness and Comfort Index for Robot Movements Along Prescribed Paths.

IEEE transactions on cybernetics·2026

Same journal

Robust Semiglobal and Global Stabilization for Nonlinear Normal Form Systems by Time-Varying Feedback.

IEEE transactions on cybernetics·2026

Same journal

Adaptive Global Asymptotic Output Stabilization of Uncertain Nonlinear Systems Under Dynamic State/Input Quantization.

IEEE transactions on cybernetics·2026

Same journal

Accelerated Distributed Gradient Tracking for Constrained Aggregative Optimization Over Time-Varying Digraphs.

IEEE transactions on cybernetics·2026

Same journal

Small-Gain-Based Plug-and-Play Distributed Control Framework for DC Microgrids With Decentralized Reconfiguration.

IEEE transactions on cybernetics·2026

Same journal

Prescribed-Time Impulsive Control of High-Order Integrator Systems.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Videos

A clustering-based graph Laplacian framework for value function approximation in reinforcement learning.

Xin Xu, Zhenhua Huang, Daniel Graves

IEEE Transactions on Cybernetics

|May 8, 2014

Summary

This summary is machine-generated.

This study introduces a novel clustering-based graph Laplacian framework for reinforcement learning (RL) feature representation and value function approximation (VFA). The new method efficiently generates basis functions, improving control performance in continuous state spaces with fewer samples.

Related Experiment Videos

Area of Science:

Artificial Intelligence
Machine Learning
Control Theory

Background:

Sequential decision problems with large or continuous state spaces are challenging in reinforcement learning (RL).
Feature representation and value function approximation (VFA) are critical research areas for addressing these challenges.
Existing methods may require extensive data or computational resources.

Purpose of the Study:

To present a clustering-based graph Laplacian framework for feature representation and VFA in RL.
To enable efficient handling of continuous state spaces in Markov decision processes (MDPs).
To improve the performance and sample efficiency of RL algorithms.

Main Methods:

Constructing a graph Laplacian using clustering techniques (K-means, Fuzzy C-means) via subsampling in continuous state MDPs.
Generating basis functions for VFA through spectral analysis of the graph Laplacian.
Integrating the framework with representation policy iteration (RPI) algorithms.

Main Results:

The proposed approach automatically generates basis functions for VFA.
Fewer sample points are needed to compute efficient basis functions compared to previous RPI methods.
Improved learning control performance was observed across various parameter settings.

Conclusions:

The clustering-based graph Laplacian framework offers an efficient solution for feature representation and VFA in RL with continuous state spaces.
This method enhances sample efficiency and control performance.
The framework provides a robust approach for tackling complex sequential decision problems.