Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Videos

Meta-learning in reinforcement learning.

Nicolas Schweighofer¹, Kenji Doya

¹CREST, Japan Science and Technology Corporation, ATR, Human Information Science Laboratories, 2-2-2 Hikaridai, Seika-cho, Soraku-gun, 619-0288, Kyoto, Japan. nicolas@atr.co.jp

Neural Networks : the Official Journal of the International Neural Network Society

|February 11, 2003

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A composite measure of cerebral small vessel disease predicts cognitive change after stroke.

medRxiv : the preprint server for health sciences·2026

Same author

CALM-VLM: CALIBRATION AND SELECTIVE PREDICTION IN VISION-LANGUAGE MODELS FOR RELIABLE BRAIN MRI CLASSIFICATION.

bioRxiv : the preprint server for biology·2026

Same author

Self-Organizing Recruitment of Compensatory Cortical Areas Post-Stroke Can Maximize Residual Motor Performance.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2026

Same author

Associations between contralesional neuroplasticity and motor impairment through deep learning-derived MRI regional brain age in chronic stroke (ENIGMA): a multicohort, retrospective, observational study.

The Lancet. Digital health·2026

Same author

A Streamlined 4-item Wolf Motor Function Test for Efficient Assessment of Upper Extremity Motor Function in Chronic Stroke Survivors.

Neurorehabilitation and neural repair·2026

Same author

Towards AI-based precision rehabilitation via contextual model-based reinforcement learning.

Journal of neuroengineering and rehabilitation·2025

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

This study introduces a biologically plausible meta-reinforcement learning algorithm to adaptively tune reinforcement learning meta-parameters. The algorithm successfully optimizes parameters in dynamic environments, suggesting dopamine neuron firing encodes meta-learning signals.

Area of Science:

Computational neuroscience
Machine learning
Behavioral science

Background:

Reinforcement learning (RL) meta-parameters require tuning to match environmental dynamics and agent performance.
Adaptive tuning is crucial for optimizing RL agents in complex, changing environments.

Purpose of the Study:

To propose and evaluate a biologically plausible meta-reinforcement learning algorithm for adaptive meta-parameter tuning.
To investigate the algorithm's robustness in both simulated and real-world control tasks.
To explore the role of dopamine neuron firing in encoding meta-learning signals.

Main Methods:

Development of a novel meta-reinforcement learning algorithm inspired by biological plausibility.
Testing the algorithm in a simulated Markov decision task.

Related Experiment Videos

Validation of the algorithm in a non-linear control task.

Main Results:

The algorithm robustly identified appropriate meta-parameter values across different environments.
The algorithm effectively controlled the meta-parameter time course in both static and dynamic settings.
Demonstrated successful adaptation to environmental changes and task demands.

Conclusions:

The proposed meta-reinforcement learning algorithm offers a robust and adaptive approach to tuning RL parameters.
The findings suggest that dopamine neuron activity, specifically phasic and tonic firing, may serve as a neural substrate for meta-learning in reinforcement learning.
This work bridges computational approaches to reinforcement learning with neurobiological mechanisms.