Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

334
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
334
Associative Learning01:27

Associative Learning

236
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
236
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

85
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
85
Reinforcement Schedules01:24

Reinforcement Schedules

108
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
108
Behavior Modification01:21

Behavior Modification

94
Behavioral approaches have often been criticized for ignoring mental processes and focusing solely on observable behavior. However, these approaches provide an optimistic perspective for individuals seeking to change their behaviors. Rather than concentrating on intrinsic personality traits, behavioral approaches suggest that even longstanding habits can be modified by changing the reward contingencies that maintain them.
A real-world application of operant conditioning principles is applied...
94
Cognitive Learning01:21

Cognitive Learning

93
Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...
93

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Photoinduced Alkene Carbothiolation via Fe/Ni Dual Catalysis.

Organic letters·2026
Same author

Differential Causal Associations of Gut Microbiota, Blood Metabolites, and Immune Cell Phenotypes With Early- and Late-Onset Alzheimer's Disease: A Bidirectional Mendelian Randomization Analysis.

Cureus·2026
Same author

Single-Cell Transcriptomic Profiling Reveals Molecular Alterations in the Airway Epithelial Cells of Children with Wheezing Infected by Respiratory Syncytial Virus.

Viral immunology·2026
Same author

From the Brain Cell Atlas to Precision Neurology: A review of the application of AI-driven multi-omics in brain science.

GigaScience·2026
Same author

Dual-Network Nanocellulose Hydrogels with Dynamic Schiff Base and Borate Bonds for Rapid Self-Healing and Antioxidant Wound Healing.

ACS applied materials & interfaces·2026
Same author

[Bone marrow infiltration of large B-cell lymphoma with clinical manifestations similar to systemic lupus erythematosus: A case report].

Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences·2026
Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026
Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026
Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026
Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026
Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026
Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: May 9, 2025

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

7.4K

Offline reinforcement learning combining generalized advantage estimation and modality decomposition interaction.

Kaixin Jin1,2, Lifang Wang3,4, Xiwen Wang5

  • 1Shanxi Province Key Laboratory of Biomedical Imaging and Imaging Big Data, Taiyuan, 030051, China.

Scientific Reports
|May 4, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces CGM, a novel approach for offline reinforcement learning that improves trajectory stitching and multimodal interaction. CGM enhances action prediction by effectively integrating generalized advantage estimation and modality decomposition interaction, achieving superior performance on benchmark datasets.

Keywords:
ConvformerGeneralized advantage estimationModality decomposition interactionOffline reinforcement learningTransformer

More Related Videos

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

19.9K
Pavlovian Conditioned Approach Training in Rats
06:57

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

10.9K

Related Experiment Videos

Last Updated: May 9, 2025

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

7.4K
Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

19.9K
Pavlovian Conditioned Approach Training in Rats
06:57

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

10.9K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Robotics

Background:

  • Transformers show promise in offline reinforcement learning (RL) for action prediction through trajectory modeling.
  • Existing Transformer methods struggle with effective trajectory stitching and capturing deep multimodal interactions.

Purpose of the Study:

  • To propose CGM, an offline RL approach enhancing trajectory stitching and multimodal interaction for improved action prediction.
  • To address limitations in existing Transformer-based RL methods.

Main Methods:

  • CGM combines Generalized Advantage Estimation (GAE) for dataset relabeling and improved trajectory stitching.
  • Modality Decomposition Interaction (MDI) employs an encoder (ConvFormer for intra-modal, dual-Transformer for inter-modal) and a decoder.
  • The encoder captures intra-modal associations and facilitates deep cross-modal information exchange between states and actions.

Main Results:

  • CGM demonstrated superior performance compared to state-of-the-art baseline methods on the D4RL dataset.
  • On the MuJoCo dataset, CGM outperformed the optimal comparison method by 2.89%.

Conclusions:

  • CGM effectively enhances trajectory stitching and deep multimodal interactions in offline RL.
  • The proposed approach shows significant improvements in action prediction accuracy and overall performance.