Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Introduction to Learning

Introduction to Learning

Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...

Woodward–Hoffmann Selection Rules and Microscopic Reversibility

Woodward–Hoffmann Selection Rules and Microscopic Reversibility

Electrocyclic reactions, cycloadditions, and sigmatropic rearrangements are concerted pericyclic reactions that proceed via a cyclic transition state. These reactions are stereospecific and regioselective. The stereochemistry of the products depends on the symmetry characteristics of the interacting orbitals and the reaction conditions. Accordingly, pericyclic reactions are classified as either symmetry-allowed or symmetry-forbidden. Woodward and Hoffmann presented the selection criteria for...

Types of Genetic Transfer Between Organisms

Types of Genetic Transfer Between Organisms

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Radiocontrast agent and intraductal pressure promote the progression of post-ERCP pancreatitis by regulating inflammatory response, cellular apoptosis, and tight junction integrity.

Pancreatology : official journal of the International Association of Pancreatology (IAP) ... [et al.]·2021

Same author

Occurrence and fate of polycyclic aromatic hydrocarbons from electronic waste dismantling activities: A critical review from environmental pollution to human health.

Journal of hazardous materials·2021

Same author

ATG16L2 overexpression is associated with a good prognosis in colorectal cancer.

Journal of gastrointestinal oncology·2021

Same author

A Novel Pyroptosis-Related Gene Signature for Prognostic Prediction of Head and Neck Squamous Cell Carcinoma.

International journal of general medicine·2021

Same author

Application of dermal regenerative template in reconstructing skin defects after plantar malignant melanoma excision.

Journal of B.U.ON. : official journal of the Balkan Union of Oncology·2021

Same author

Multi-alleles predict primary non-response to infliximab therapy in Crohn's disease.

Gastroenterology report·2021

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 27, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Distributional generative adversarial imitation learning with reproducing kernel generalization.

Yirui Zhou¹, Mengxiao Lu¹, Xiaowei Liu¹

¹Department of Mathematics, College of Sciences, Shanghai University, Shanghai, 200444, China.

Neural Networks : the Official Journal of the International Neural Network Society

|June 5, 2023

Summary

This summary is machine-generated.

Generative adversarial imitation learning (GAIL) is improved by integrating distributional reinforcement learning (RL). The new greedy distributional soft gradient (GDSG) algorithm enhances policy generalization and stability for better expert mimicry.

Keywords:

Computational properties Distributional reinforcement learning Generative adversarial imitation learning Policy generalization

More Related Videos

Analyzing Mitochondrial Morphology Through Simulation Supervised Learning

Analyzing Mitochondrial Morphology Through Simulation Supervised Learning

Published on: March 3, 2023

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Jul 27, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Analyzing Mitochondrial Morphology Through Simulation Supervised Learning

Analyzing Mitochondrial Morphology Through Simulation Supervised Learning

Published on: March 3, 2023

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Machine Learning
Artificial Intelligence
Robotics

Background:

Generative adversarial imitation learning (GAIL) frames imitation learning (IL) as matching expert and learned policy state-action distributions.
Generalization and computational properties of policy classes are critical for GAIL's effectiveness.
Instability in GAIL, particularly with off-policy training, is often caused by Q-value overestimation.

Purpose of the Study:

To enhance the generalization capabilities of Generative adversarial imitation learning (GAIL).
To introduce distributional reinforcement learning (RL) into GAIL for improved stability and performance.
To propose a novel algorithm, greedy distributional soft gradient (GDSG), for solving GAIL.

Main Methods:

Proving generalization guarantees in GAIL for controlled policy classes.
Integrating distributional RL with GAIL to address Q-value overestimation.
Developing the greedy distributional soft gradient (GDSG) algorithm incorporating maximum entropy objectives.

Main Results:

Demonstrated that policy generalization can be guaranteed in GAIL under controlled conditions.
Showcased that distributional RL alleviates Q-value overestimation, enhancing GAIL's stability.
Verified through experiments in MuJoCo environments that GDSG outperforms previous GAIL variants in mimicking expert demonstrations.

Conclusions:

The proposed GDSG algorithm effectively improves imitation learning by leveraging distributional RL and maximum entropy objectives.
GDSG offers enhanced performance, sample efficiency, and stability compared to existing GAIL methods.
The study confirms the benefits of controlled policy classes and distributional RL for robust imitation learning.