Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

436
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
436
Reinforcement01:23

Reinforcement

791
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
791
Observational Learning01:12

Observational Learning

795
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
795
Steps in the Modeling Process01:14

Steps in the Modeling Process

601
Albert Bandura's theory of observational learning identifies four critical processes: attention, retention, motor reproduction, and reinforcement or motivation.
Attention is the first necessary component for observational learning. It involves focusing on what the model is doing and saying. For example, if you decide to take a drawing class to enhance your skills, you need to pay close attention to the instructor's words and hand movements. The characteristics of the model significantly...
601
Comparison between RL and RC circuits01:24

Comparison between RL and RC circuits

5.9K
An RC circuit consists of resistance and capacitance, while in an RL circuit, capacitance is replaced by an inductor. RL and RC circuits are first-order differential circuits that store energy. An RC circuit stores energy in the electric field, while an RL circuit stores energy in the magnetic field. When connected to a battery, an RC circuit charges the capacitor, causing the current to decrease from maximum to zero upon being fully charged. This increases the voltage across the capacitor from...
5.9K
Stereotype Content Model02:16

Stereotype Content Model

15.3K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
15.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Sparse identification of nonlinear dynamics and Koopman operators with Shallow Recurrent Decoder Networks.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

T-SHRED: symbolic regression for regularization and model discovery with transformer shallow recurrent decoders.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026
Same author

ECG-Based Prediction of Shock-Refractory Ventricular Fibrillation During Resuscitation Without Interrupting CPR.

Circulation. Arrhythmia and electrophysiology·2026
Same author

Reduced order modeling with shallow recurrent decoder networks.

Nature communications·2025
Same author

Arousal as a universal embedding for spatiotemporal brain dynamics.

Nature·2025
Same author

Lagrangian gradient regression for the detection of coherent structures from sparse trajectory data.

Royal Society open science·2024
Same journal

Interplay between oxygen redox and interfacial stability of Li-rich positive electrodes in sulfide-based all-solid-state batteries.

Nature communications·2026
Same journal

Breaking dependence on melanisation imparts diversity to a dogmatic invasion strategy of phytopathogenic fungi.

Nature communications·2026
Same journal

Hydroxyl-rich nanocavities on perovskite enable nearly barrierless intramolecular hydrogen transfer for nitrate electroreduction to ammonia.

Nature communications·2026
Same journal

Household mobility responses to weather extremes in Kyrgyzstan.

Nature communications·2026
Same journal

Autonomous Motion Vision with Tri-bulk-heterojunctioned Organic Adaptation Transistor.

Nature communications·2026
Same journal

Tissue-adhesive hydrogel optical fiber for peripheral optogenetic neuromodulation.

Nature communications·2026
See all related articles

Related Experiment Video

Updated: Jan 10, 2026

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.1K

SINDy-RL for interpretable and efficient model-based reinforcement learning.

Nicholas Zolman1,2, Christian Lagemann3, Urban Fasel4

  • 1Department of Mechanical Engineering, University of Washington, Seattle, WA, USA. nzolman@uw.edu.

Nature Communications
|November 28, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces SINDy-RL, a new framework combining sparse dictionary learning and deep reinforcement learning (DRL). SINDy-RL creates efficient, interpretable control policies using significantly fewer training examples than traditional DRL.

More Related Videos

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.1K
The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task
10:39

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

9.0K

Related Experiment Videos

Last Updated: Jan 10, 2026

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.1K
Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.1K
The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task
10:39

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

9.0K

Area of Science:

  • Control Theory
  • Machine Learning
  • Fluid Dynamics

Background:

  • Deep reinforcement learning (DRL) excels at complex control but demands extensive data and yields black-box policies.
  • Sparse dictionary learning methods like SINDy offer efficient, interpretable models, particularly in low-data scenarios.

Purpose of the Study:

  • To introduce SINDy-RL, a unified framework integrating SINDy and DRL.
  • To develop efficient, interpretable, and trustworthy data-driven models for dynamics, rewards, and control policies.
  • To address the data inefficiency and interpretability limitations of conventional DRL.

Main Methods:

  • Integration of sparse identification of nonlinear dynamics (SINDy) with deep reinforcement learning (DRL).
  • Development of a unifying framework (SINDy-RL) for learning dynamics, reward functions, and control policies.
  • Application to benchmark control tasks and flow control problems, including gust mitigation on an airfoil.

Main Results:

  • SINDy-RL achieves performance comparable to state-of-the-art DRL algorithms.
  • The framework requires significantly fewer environmental interactions for training compared to traditional DRL.
  • The resulting control policy is orders of magnitude smaller and more interpretable than DRL-derived policies.

Conclusions:

  • SINDy-RL offers a more data-efficient and interpretable alternative to standard DRL for control tasks.
  • The framework provides trustworthy and computationally efficient models suitable for various applications, including embedded systems.
  • This approach enhances the practical applicability of reinforcement learning in complex dynamic environments.