Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Introduction to Learning

Introduction to Learning

Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A lipid droplet-targeted fluorescent probe for selective imaging of cysteine fluctuations in drug-induced liver injury.

Analytical methods : advancing methods and applications·2026

Same author

In situ dynamic modulation of zero-valent and low-valent copper ratio for constructing stable copper catalysts for acetylene hydrochlorination.

Journal of colloid and interface science·2026

Same author

A case report of synchronous bilateral breast cancer with distinct histological subtypes and favorable long-term survival.

Frontiers in oncology·2026

Same author

Spatial-cellular resolution analysis of ferroptosis-associated immune microenvironment heterogeneity in osteoarthritic cartilage.

Immunobiology·2026

Same author

The impact of drought stress on the physiological biochemical indexes and metabolites in Panax notoginseng.

Scientific reports·2026

Same author

An interface-confined ultrabright AIE nanoparticle-enhanced lateral flow immunoassay platform for full-range and accurate CRP detection.

Talanta·2026

Same journal

Granular Ball-Based Noise-Resistant Fuzzy Multineighborhood Feature Selection via Label Enhancement and Feature Graph.

IEEE transactions on neural networks and learning systems·2026

Same journal

Fighting Evolving Spam With ARTMAP Models: A Noise-Resilient Online Detection Framework.

IEEE transactions on neural networks and learning systems·2026

Same journal

HyperSAT: Unsupervised Hypergraph Neural Networks for Weighted MaxSAT Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

Negation of Basic Belief Assignment in Multisource Information Fusion on Dempster-Shafer Theory With Applications in Pattern Classification.

IEEE transactions on neural networks and learning systems·2026

Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 27, 2025

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Deep Reinforcement Learning: A Survey.

Xu Wang, Sen Wang, Xingxing Liang

IEEE Transactions on Neural Networks and Learning Systems

|September 28, 2022

Summary

This summary is machine-generated.

Deep reinforcement learning (DRL) combines deep learning and reinforcement learning for advanced control. This review covers DRL theories, algorithms, and challenges like limited samples and multi-agent systems.

More Related Videos

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

Published on: March 3, 2023

Related Experiment Videos

Last Updated: Aug 27, 2025

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

Published on: March 3, 2023

Area of Science:

Artificial Intelligence
Machine Learning
Deep Reinforcement Learning

Background:

Deep reinforcement learning (DRL) merges deep learning's feature representation with reinforcement learning's decision-making for end-to-end control.
DRL has advanced significantly in tasks with high-dimensional inputs and optimal decision-making over the past decade.
Challenges persist in DRL, particularly in sample-limited, sparse-reward, and multi-agent learning control tasks.

Purpose of the Study:

To provide a comprehensive overview of the fundamental theories, key algorithms, and primary research domains of DRL.
To summarize advances in value-based, policy-based, and maximum entropy-based DRL algorithms.
To analyze and discuss future research directions in DRL.

Main Methods:

Review of existing literature on Deep Reinforcement Learning.
Categorization and summarization of DRL algorithms (value-based, policy-based, maximum entropy-based).
Analysis of current challenges and future research trends in DRL.

Main Results:

Significant advances in DRL have been observed across various complex tasks.
Various solutions and theories have been proposed to address DRL's inherent challenges.
Deep learning has spurred advancements in subfields like hierarchical and multi-agent reinforcement learning.

Conclusions:

DRL offers powerful capabilities for learning control from high-dimensional data.
Continued research is essential to overcome limitations in sample efficiency, reward sparsity, and multi-agent coordination.
The field is poised for further development, driven by deep learning integration and exploration of new algorithmic approaches.