Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Comparison between RL and RC circuits

Comparison between RL and RC circuits

An RC circuit consists of resistance and capacitance, while in an RL circuit, capacitance is replaced by an inductor. RL and RC circuits are first-order differential circuits that store energy. An RC circuit stores energy in the electric field, while an RL circuit stores energy in the magnetic field. When connected to a battery, an RC circuit charges the capacitor, causing the current to decrease from maximum to zero upon being fully charged. This increases the voltage across the capacitor from...

Operant Conditioning

Operant Conditioning

Operant conditioning, a key concept in behavioral psychology, involves using reinforcement and punishment to alter the likelihood of a behavior being repeated. B.F. introduced this type of conditioning. Skinner focused on voluntary behaviors and the consequences that follow them, influencing whether these behaviors will be strengthened or diminished.
Reinforcement in operant conditioning can be positive or negative, both of which serve to increase the likelihood of a behavior. Positive...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Leveraging VQ-VAE tokenization for autoregressive modeling of medical time series.

Artificial intelligence in medicine·2024

Same author

Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques.

Drug safety·2023

Same author

Interpretable disease prediction using heterogeneous patient records with self-attentive fusion encoder.

Journal of the American Medical Informatics Association : JAMIA·2021

Same author

Efficient spread-size approximation of opinion spreading in general social networks.

Physical review. E·2019

Same author

Hierarchical ordering with partial pairwise hierarchical relationships on the macaque brain data sets.

PloS one·2017

Same author

Rumor Detection over Varying Time Windows.

PloS one·2017

Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026

Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026

Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026

Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026

Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026

Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 29, 2025

Pavlovian Conditioned Approach Training in Rats

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

Action-driven contrastive representation for reinforcement learning.

Minbeom Kim¹, Kyeongha Rho², Yong-Duk Kim²

¹Graduate School of Artificial Intelligence, Seoul National University, Seoul, Republic of Korea.

|March 18, 2022

Summary

This summary is machine-generated.

This study introduces Action-Driven Auxiliary Task (ADAT), a novel method for reinforcement learning. ADAT improves sample-efficiency and generalization in control tasks by learning essential features from images.

More Related Videos

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Related Experiment Videos

Last Updated: Sep 29, 2025

Pavlovian Conditioned Approach Training in Rats

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Area of Science:

Artificial Intelligence
Machine Learning
Computer Vision

Background:

Reinforcement learning (RL) from high-dimensional images faces challenges in sample-efficiency and generalization.
Existing representation learning methods struggle with environmental diversity and task-specific feature extraction.

Purpose of the Study:

To propose a novel contrastive representation method, Action-Driven Auxiliary Task (ADAT).
To enhance feature learning for improved sample-efficiency and generalization in RL control tasks.

Main Methods:

ADAT utilizes an augmented state-action dictionary to learn representations.
The method forces representations to focus on action-relevant features while ignoring irrelevant details.
Maximizes agreement between observations sharing the same actions.

Main Results:

ADAT significantly outperforms existing model-free and model-based RL algorithms.
Superior performance demonstrated on Atari and OpenAI ProcGen benchmarks.
The method shows marked improvements in sample-efficiency and generalization capabilities.

Conclusions:

ADAT offers a more robust and effective approach to representation learning in RL.
The proposed method addresses key limitations of prior works in diverse and complex environments.
ADAT advances the state-of-the-art in sample-efficient and generalizable reinforcement learning.