Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

658
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
658
Observational Learning01:12

Observational Learning

231
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
231
Associative Learning01:27

Associative Learning

462
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
462
Introduction to Learning01:18

Introduction to Learning

479
Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...
479
Woodward–Hoffmann Selection Rules and Microscopic Reversibility01:34

Woodward–Hoffmann Selection Rules and Microscopic Reversibility

3.2K
Electrocyclic reactions, cycloadditions, and sigmatropic rearrangements are concerted pericyclic reactions that proceed via a cyclic transition state. These reactions are stereospecific and regioselective. The stereochemistry of the products depends on the symmetry characteristics of the interacting orbitals and the reaction conditions. Accordingly, pericyclic reactions are classified as either symmetry-allowed or symmetry-forbidden. Woodward and Hoffmann presented the selection criteria for...
3.2K
Types of Genetic Transfer Between Organisms02:18

Types of Genetic Transfer Between Organisms

5.4K
5.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Radiocontrast agent and intraductal pressure promote the progression of post-ERCP pancreatitis by regulating inflammatory response, cellular apoptosis, and tight junction integrity.

Pancreatology : official journal of the International Association of Pancreatology (IAP) ... [et al.]·2021
Same author

Occurrence and fate of polycyclic aromatic hydrocarbons from electronic waste dismantling activities: A critical review from environmental pollution to human health.

Journal of hazardous materials·2021
Same author

ATG16L2 overexpression is associated with a good prognosis in colorectal cancer.

Journal of gastrointestinal oncology·2021
Same author

A Novel Pyroptosis-Related Gene Signature for Prognostic Prediction of Head and Neck Squamous Cell Carcinoma.

International journal of general medicine·2021
Same author

Application of dermal regenerative template in reconstructing skin defects after plantar malignant melanoma excision.

Journal of B.U.ON. : official journal of the Balkan Union of Oncology·2021
Same author

Multi-alleles predict primary non-response to infliximab therapy in Crohn's disease.

Gastroenterology report·2021
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Jul 27, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

632

Distributional generative adversarial imitation learning with reproducing kernel generalization.

Yirui Zhou1, Mengxiao Lu1, Xiaowei Liu1

  • 1Department of Mathematics, College of Sciences, Shanghai University, Shanghai, 200444, China.

Neural Networks : the Official Journal of the International Neural Network Society
|June 5, 2023
PubMed
Summary
This summary is machine-generated.

Generative adversarial imitation learning (GAIL) is improved by integrating distributional reinforcement learning (RL). The new greedy distributional soft gradient (GDSG) algorithm enhances policy generalization and stability for better expert mimicry.

Keywords:
Computational propertiesDistributional reinforcement learningGenerative adversarial imitation learningPolicy generalization

More Related Videos

Analyzing Mitochondrial Morphology Through Simulation Supervised Learning
12:06

Analyzing Mitochondrial Morphology Through Simulation Supervised Learning

Published on: March 3, 2023

4.1K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

348

Related Experiment Videos

Last Updated: Jul 27, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

632
Analyzing Mitochondrial Morphology Through Simulation Supervised Learning
12:06

Analyzing Mitochondrial Morphology Through Simulation Supervised Learning

Published on: March 3, 2023

4.1K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

348

Area of Science:

  • Machine Learning
  • Artificial Intelligence
  • Robotics

Background:

  • Generative adversarial imitation learning (GAIL) frames imitation learning (IL) as matching expert and learned policy state-action distributions.
  • Generalization and computational properties of policy classes are critical for GAIL's effectiveness.
  • Instability in GAIL, particularly with off-policy training, is often caused by Q-value overestimation.

Purpose of the Study:

  • To enhance the generalization capabilities of Generative adversarial imitation learning (GAIL).
  • To introduce distributional reinforcement learning (RL) into GAIL for improved stability and performance.
  • To propose a novel algorithm, greedy distributional soft gradient (GDSG), for solving GAIL.

Main Methods:

  • Proving generalization guarantees in GAIL for controlled policy classes.
  • Integrating distributional RL with GAIL to address Q-value overestimation.
  • Developing the greedy distributional soft gradient (GDSG) algorithm incorporating maximum entropy objectives.

Main Results:

  • Demonstrated that policy generalization can be guaranteed in GAIL under controlled conditions.
  • Showcased that distributional RL alleviates Q-value overestimation, enhancing GAIL's stability.
  • Verified through experiments in MuJoCo environments that GDSG outperforms previous GAIL variants in mimicking expert demonstrations.

Conclusions:

  • The proposed GDSG algorithm effectively improves imitation learning by leveraging distributional RL and maximum entropy objectives.
  • GDSG offers enhanced performance, sample efficiency, and stability compared to existing GAIL methods.
  • The study confirms the benefits of controlled policy classes and distributional RL for robust imitation learning.