Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Behavior Modification

Behavior Modification

Behavioral approaches have often been criticized for ignoring mental processes and focusing solely on observable behavior. However, these approaches provide an optimistic perspective for individuals seeking to change their behaviors. Rather than concentrating on intrinsic personality traits, behavioral approaches suggest that even longstanding habits can be modified by changing the reward contingencies that maintain them.
A real-world application of operant conditioning principles is applied...

Cognitive Learning

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Photoinduced Alkene Carbothiolation via Fe/Ni Dual Catalysis.

Organic letters·2026

Same author

Differential Causal Associations of Gut Microbiota, Blood Metabolites, and Immune Cell Phenotypes With Early- and Late-Onset Alzheimer's Disease: A Bidirectional Mendelian Randomization Analysis.

Cureus·2026

Same author

Single-Cell Transcriptomic Profiling Reveals Molecular Alterations in the Airway Epithelial Cells of Children with Wheezing Infected by Respiratory Syncytial Virus.

Viral immunology·2026

Same author

From the Brain Cell Atlas to Precision Neurology: A review of the application of AI-driven multi-omics in brain science.

GigaScience·2026

Same author

Dual-Network Nanocellulose Hydrogels with Dynamic Schiff Base and Borate Bonds for Rapid Self-Healing and Antioxidant Wound Healing.

ACS applied materials & interfaces·2026

Same author

[Bone marrow infiltration of large B-cell lymphoma with clinical manifestations similar to systemic lupus erythematosus: A case report].

Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences·2026

Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026

Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026

Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026

Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026

Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026

Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 9, 2025

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Offline reinforcement learning combining generalized advantage estimation and modality decomposition interaction.

Kaixin Jin^1,2, Lifang Wang^3,4, Xiwen Wang⁵

¹Shanxi Province Key Laboratory of Biomedical Imaging and Imaging Big Data, Taiyuan, 030051, China.

Scientific Reports

|May 4, 2025

Summary

This summary is machine-generated.

This study introduces CGM, a novel approach for offline reinforcement learning that improves trajectory stitching and multimodal interaction. CGM enhances action prediction by effectively integrating generalized advantage estimation and modality decomposition interaction, achieving superior performance on benchmark datasets.

Keywords:

Convformer Generalized advantage estimation Modality decomposition interaction Offline reinforcement learning Transformer

More Related Videos

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Pavlovian Conditioned Approach Training in Rats

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

Related Experiment Videos

Last Updated: May 9, 2025

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Pavlovian Conditioned Approach Training in Rats

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

Area of Science:

Artificial Intelligence
Machine Learning
Robotics

Background:

Transformers show promise in offline reinforcement learning (RL) for action prediction through trajectory modeling.
Existing Transformer methods struggle with effective trajectory stitching and capturing deep multimodal interactions.

Purpose of the Study:

To propose CGM, an offline RL approach enhancing trajectory stitching and multimodal interaction for improved action prediction.
To address limitations in existing Transformer-based RL methods.

Main Methods:

CGM combines Generalized Advantage Estimation (GAE) for dataset relabeling and improved trajectory stitching.
Modality Decomposition Interaction (MDI) employs an encoder (ConvFormer for intra-modal, dual-Transformer for inter-modal) and a decoder.
The encoder captures intra-modal associations and facilitates deep cross-modal information exchange between states and actions.

Main Results:

CGM demonstrated superior performance compared to state-of-the-art baseline methods on the D4RL dataset.
On the MuJoCo dataset, CGM outperformed the optimal comparison method by 2.89%.

Conclusions:

CGM effectively enhances trajectory stitching and deep multimodal interactions in offline RL.
The proposed approach shows significant improvements in action prediction accuracy and overall performance.