Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Steps in the Modeling Process

Steps in the Modeling Process

Albert Bandura's theory of observational learning identifies four critical processes: attention, retention, motor reproduction, and reinforcement or motivation.
Attention is the first necessary component for observational learning. It involves focusing on what the model is doing and saying. For example, if you decide to take a drawing class to enhance your skills, you need to pay close attention to the instructor's words and hand movements. The characteristics of the model significantly...

Comparison between RL and RC circuits

Comparison between RL and RC circuits

An RC circuit consists of resistance and capacitance, while in an RL circuit, capacitance is replaced by an inductor. RL and RC circuits are first-order differential circuits that store energy. An RC circuit stores energy in the electric field, while an RL circuit stores energy in the magnetic field. When connected to a battery, an RC circuit charges the capacitor, causing the current to decrease from maximum to zero upon being fully charged. This increases the voltage across the capacitor from...

Stereotype Content Model

Stereotype Content Model

The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Sparse identification of nonlinear dynamics and Koopman operators with Shallow Recurrent Decoder Networks.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same author

T-SHRED: symbolic regression for regularization and model discovery with transformer shallow recurrent decoders.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026

Same author

ECG-Based Prediction of Shock-Refractory Ventricular Fibrillation During Resuscitation Without Interrupting CPR.

Circulation. Arrhythmia and electrophysiology·2026

Same author

Reduced order modeling with shallow recurrent decoder networks.

Nature communications·2025

Same author

Arousal as a universal embedding for spatiotemporal brain dynamics.

Nature·2025

Same author

Lagrangian gradient regression for the detection of coherent structures from sparse trajectory data.

Royal Society open science·2024

Same journal

Interplay between oxygen redox and interfacial stability of Li-rich positive electrodes in sulfide-based all-solid-state batteries.

Nature communications·2026

Same journal

Breaking dependence on melanisation imparts diversity to a dogmatic invasion strategy of phytopathogenic fungi.

Nature communications·2026

Same journal

Hydroxyl-rich nanocavities on perovskite enable nearly barrierless intramolecular hydrogen transfer for nitrate electroreduction to ammonia.

Nature communications·2026

Same journal

Household mobility responses to weather extremes in Kyrgyzstan.

Nature communications·2026

Same journal

Autonomous Motion Vision with Tri-bulk-heterojunctioned Organic Adaptation Transistor.

Nature communications·2026

Same journal

Tissue-adhesive hydrogel optical fiber for peripheral optogenetic neuromodulation.

Nature communications·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 10, 2026

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

SINDy-RL for interpretable and efficient model-based reinforcement learning.

Nicholas Zolman^1,2, Christian Lagemann³, Urban Fasel⁴

¹Department of Mechanical Engineering, University of Washington, Seattle, WA, USA. nzolman@uw.edu.

Nature Communications

|November 28, 2025

Summary

This summary is machine-generated.

This study introduces SINDy-RL, a new framework combining sparse dictionary learning and deep reinforcement learning (DRL). SINDy-RL creates efficient, interpretable control policies using significantly fewer training examples than traditional DRL.

More Related Videos

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

Related Experiment Videos

Last Updated: Jan 10, 2026

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

Area of Science:

Control Theory
Machine Learning
Fluid Dynamics

Background:

Deep reinforcement learning (DRL) excels at complex control but demands extensive data and yields black-box policies.
Sparse dictionary learning methods like SINDy offer efficient, interpretable models, particularly in low-data scenarios.

Purpose of the Study:

To introduce SINDy-RL, a unified framework integrating SINDy and DRL.
To develop efficient, interpretable, and trustworthy data-driven models for dynamics, rewards, and control policies.
To address the data inefficiency and interpretability limitations of conventional DRL.

Main Methods:

Integration of sparse identification of nonlinear dynamics (SINDy) with deep reinforcement learning (DRL).
Development of a unifying framework (SINDy-RL) for learning dynamics, reward functions, and control policies.
Application to benchmark control tasks and flow control problems, including gust mitigation on an airfoil.

Main Results:

SINDy-RL achieves performance comparable to state-of-the-art DRL algorithms.
The framework requires significantly fewer environmental interactions for training compared to traditional DRL.
The resulting control policy is orders of magnitude smaller and more interpretable than DRL-derived policies.

Conclusions:

SINDy-RL offers a more data-efficient and interpretable alternative to standard DRL for control tasks.
The framework provides trustworthy and computationally efficient models suitable for various applications, including embedded systems.
This approach enhances the practical applicability of reinforcement learning in complex dynamic environments.