Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning because...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length, the...

State Function, Exact and Inexact Differentials

State Function, Exact and Inexact Differentials

A state function is a thermodynamic property that depends solely on the current state of a system, irrespective of its history or how it arrived at that state. These functions are represented by capital letters, such as U, H, and S, which stand for internal energy, enthalpy, and entropy, respectively.For instance, the value of internal energy depends on the system's state variables and remains unaffected by the process path. This means that whether the system underwent a linear process or a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Human-AI Cooperation in Healthcare and Rehabilitation.

Delaware journal of public health·2026

Same author

Music for epilepsy: impact of temporal music structure on intervention outcomes.

Epilepsy & behavior : E&B·2026

Same author

Training sparse convolutional deep predictive coding networks with attention.

Neural networks : the official journal of the International Neural Network Society·2026

Same author

A generative spike prediction model using behavioral reinforcement for re-establishing neural functional connectivity.

Nature computational science·2026

Same author

Transfer Learning in EEG-based Reinforcement Learning Brain Machine Interfaces via Q-learning Kernel Temporal Differences.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same author

A Window Analysis for the Decoding of Premovement and Movement Intentions in Freewill EEG<sup></sup>.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same journal

Analysis of End-Tidal CO2 Variability During Plateau Waves Episodes: An Information Theoretic Approach<sup></sup>.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same journal

AI and Tomosynthesis for Breast Cancer Molecular Subtyping: A step toward precision medicine<sup></sup>.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same journal

Towards Sustainable Protein Recovery from Biological Waste: Assessing Polyethersulfone-based Microfiltration.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same journal

Analysis of the cardiovascular response to standardized polymicrobial peritonitis experimental model.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same journal

Automated Wrist Ultrasound Image Bone Enhancement and Segmentation Using Deep Learning.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same journal

A Deep Learning approach for Depressive Symptoms assessment in Parkinson's disease patients using facial videos.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

See all related articles

Search research articles

Related Experiment Videos

Reinforcement learning via kernel temporal difference.

Jihye Bae¹, Pratik Chhatbar, Joseph T Francis

¹Department of Electrical and Computer Engineering, PO Box 116130 NEB 486, Bldg #33, University of Florida, Gainesville, FL 32611, USA. jihyebae@ufl.edu

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

|January 19, 2012

Summary

This summary is machine-generated.

This study introduces kernel Temporal Difference (TD)(λ) for reinforcement learning. Kernel TD(0) demonstrated faster convergence and superior performance in decoding motor states compared to traditional neural networks.

Related Experiment Videos

Area of Science:

Computational Neuroscience
Machine Learning
Reinforcement Learning

Background:

Estimating state-action value functions is crucial for reinforcement learning.
Adaptive filters offer a promising approach for function approximation in dynamic environments.

Purpose of the Study:

To introduce and evaluate kernel Temporal Difference (TD)(λ), a novel kernel adaptive filter, for reinforcement learning.
To assess the performance of kernel TD(0) in motor state decoding tasks.

Main Methods:

Implementation of a kernel adaptive filter using stochastic gradient on temporal differences.
Focus on the specific case of kernel TD(0).
Comparison with a time-delay neural network (TDNN) trained via backpropagation of temporal difference error.

Main Results:

Kernel TD(0) proved applicable for learning motor state decoding in a center-out reaching task.
Kernel TD(0) achieved faster convergence compared to the TDNN.
Kernel TD(0) provided a better solution than the TDNN.

Conclusions:

Kernel TD(0) is an effective method for state-action value function estimation in reinforcement learning.
Kernel TD(0) offers advantages in convergence speed and solution quality over TDNNs for motor control tasks.