Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Avoidance Learning and Learned Helplessness

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...

Purposive Learning

Purposive Learning

E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Antioxidant Nanozymes: From Rational Design to Biomedical Applications.

Research (Washington, D.C.)·2026

Same author

High-precision freeform DLP bioprinting by thermo-reversible gelation.

Biomaterials·2026

Same author

FAM26F is a novel regulator of depression-related behaviors by modulating hippocampal glutamatergic neuron activity.

Progress in neuro-psychopharmacology & biological psychiatry·2026

Same author

AMPK/ULK1 enhances mitophagy and steroidogenesis in duck Granulosa cells under heat exposure.

Poultry science·2026

Same author

Electroacupuncture (EA) promotes angiogenesis and ameliorates dysregulated autophagy in ischemic stroke mice by modulating the ELAVL1/SIRT1/FOXO1 pathway.

Metabolic brain disease·2026

Same author

Bioinspired Nanoplatform Potentiates Sonodynamic Immunotherapy by Remodeling the Antioxidant Tumor Microenvironment and Activating STING pathway.

Theranostics·2026

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 16, 2025

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

Learning generalizable agents via self-supervised exploration.

Baoxian Liang¹, Lihong Xu¹, Zhichao Deng¹

¹College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China.

Neural Networks : the Official Journal of the International Neural Network Society

|July 6, 2025

Summary

This summary is machine-generated.

This study introduces a new self-supervised exploration framework to improve visual reinforcement learning generalization. The method enhances sample efficiency and agent adaptability in unseen environments.

Keywords:

Self-supervised learning Task-relevant representations Visual reinforcement learning

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Sep 16, 2025

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Artificial Intelligence
Machine Learning
Robotics

Background:

Generalization is a major hurdle in visual reinforcement learning (RL), where agents trained on limited views often fail in new environments.
Directly applying self-supervised learning (SSL) to visual RL can decrease sample efficiency and training stability, hindering generalization.
Existing methods struggle to effectively integrate representation learning with RL decision-making for improved generalization.

Purpose of the Study:

To propose a novel self-supervised exploration framework for learning dynamics-relevant representations in visual reinforcement learning.
To enhance the integration of representation learning into the RL decision-making process for better generalization.
To improve sample efficiency and adaptability of RL agents in unseen environments.

Main Methods:

The proposed framework comprises two modules: a visual discrepancy inference module (VDIM) and an exploration via distributional discrepancy module (EDDM).
VDIM learns shared features across different views to retain task-relevant information and filter out irrelevant data.
EDDM actively explores the environment to identify changed features, improving agent awareness of critical visual information for decision-making.

Main Results:

The framework significantly outperforms prior methods in generalization capabilities.
Demonstrated salient improvements in sample efficiency compared to existing approaches.
The method enables agents to adapt more quickly to new scenarios by enhancing self-awareness of visual features.

Conclusions:

The novel self-supervised exploration framework effectively addresses the generalization challenge in visual reinforcement learning.
Integrating dynamics-relevant representations through VDIM and EDDM leads to superior performance and adaptability.
This approach offers a promising direction for developing more robust and efficient visual RL agents.