Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

313
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
313
Observational Learning01:12

Observational Learning

285
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
285
Reinforcement Schedules01:24

Reinforcement Schedules

231
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
231
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

748
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
748
Introduction to Learning01:18

Introduction to Learning

519
Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...
519
Associative Learning01:27

Associative Learning

533
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
533

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A lipid droplet-targeted fluorescent probe for selective imaging of cysteine fluctuations in drug-induced liver injury.

Analytical methods : advancing methods and applications·2026
Same author

In situ dynamic modulation of zero-valent and low-valent copper ratio for constructing stable copper catalysts for acetylene hydrochlorination.

Journal of colloid and interface science·2026
Same author

A case report of synchronous bilateral breast cancer with distinct histological subtypes and favorable long-term survival.

Frontiers in oncology·2026
Same author

Spatial-cellular resolution analysis of ferroptosis-associated immune microenvironment heterogeneity in osteoarthritic cartilage.

Immunobiology·2026
Same author

The impact of drought stress on the physiological biochemical indexes and metabolites in Panax notoginseng.

Scientific reports·2026
Same author

An interface-confined ultrabright AIE nanoparticle-enhanced lateral flow immunoassay platform for full-range and accurate CRP detection.

Talanta·2026
Same journal

Granular Ball-Based Noise-Resistant Fuzzy Multineighborhood Feature Selection via Label Enhancement and Feature Graph.

IEEE transactions on neural networks and learning systems·2026
Same journal

Fighting Evolving Spam With ARTMAP Models: A Noise-Resilient Online Detection Framework.

IEEE transactions on neural networks and learning systems·2026
Same journal

HyperSAT: Unsupervised Hypergraph Neural Networks for Weighted MaxSAT Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

Negation of Basic Belief Assignment in Multisource Information Fusion on Dempster-Shafer Theory With Applications in Pattern Classification.

IEEE transactions on neural networks and learning systems·2026
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Aug 27, 2025

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.5K

Deep Reinforcement Learning: A Survey.

Xu Wang, Sen Wang, Xingxing Liang

    IEEE Transactions on Neural Networks and Learning Systems
    |September 28, 2022
    PubMed
    Summary
    This summary is machine-generated.

    Deep reinforcement learning (DRL) combines deep learning and reinforcement learning for advanced control. This review covers DRL theories, algorithms, and challenges like limited samples and multi-agent systems.

    More Related Videos

    The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents
    09:01

    The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

    Published on: July 8, 2015

    12.7K
    An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice
    08:59

    An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

    Published on: March 3, 2023

    2.2K

    Related Experiment Videos

    Last Updated: Aug 27, 2025

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
    05:41

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

    Published on: February 6, 2020

    9.5K
    The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents
    09:01

    The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

    Published on: July 8, 2015

    12.7K
    An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice
    08:59

    An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

    Published on: March 3, 2023

    2.2K

    Area of Science:

    • Artificial Intelligence
    • Machine Learning
    • Deep Reinforcement Learning

    Background:

    • Deep reinforcement learning (DRL) merges deep learning's feature representation with reinforcement learning's decision-making for end-to-end control.
    • DRL has advanced significantly in tasks with high-dimensional inputs and optimal decision-making over the past decade.
    • Challenges persist in DRL, particularly in sample-limited, sparse-reward, and multi-agent learning control tasks.

    Purpose of the Study:

    • To provide a comprehensive overview of the fundamental theories, key algorithms, and primary research domains of DRL.
    • To summarize advances in value-based, policy-based, and maximum entropy-based DRL algorithms.
    • To analyze and discuss future research directions in DRL.

    Main Methods:

    • Review of existing literature on Deep Reinforcement Learning.
    • Categorization and summarization of DRL algorithms (value-based, policy-based, maximum entropy-based).
    • Analysis of current challenges and future research trends in DRL.

    Main Results:

    • Significant advances in DRL have been observed across various complex tasks.
    • Various solutions and theories have been proposed to address DRL's inherent challenges.
    • Deep learning has spurred advancements in subfields like hierarchical and multi-agent reinforcement learning.

    Conclusions:

    • DRL offers powerful capabilities for learning control from high-dimensional data.
    • Continued research is essential to overcome limitations in sample efficiency, reward sparsity, and multi-agent coordination.
    • The field is poised for further development, driven by deep learning integration and exploration of new algorithmic approaches.