Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Decision Making01:20

Decision Making

101
Decision-making is a fundamental cognitive process that involves evaluating alternatives and selecting among them. This process can range from simple choices, such as deciding what to wear, to complex decisions, like choosing a major in college or a career path. The complexity of the decision often dictates the approach we use, which can be broadly categorized into two types: automatic and controlled decision-making.
Automatic decision-making is fast, intuitive, and relies on gut feelings...
101
Timing and Consequences on Behavior01:08

Timing and Consequences on Behavior

87
In operant conditioning, the timing of reinforcement is crucial. For animals like rats and cats, immediate reinforcement (within a few seconds) is much more effective than delayed reinforcement. For example, a food reward for a rat needs to follow within 30 seconds of pressing a bar to be effective. 
Humans, however, can respond to delayed reinforcers. We often make decisions between immediate small rewards and delayed larger rewards. This ability to delay gratification is a significant...
87
Reinforcement Schedules01:24

Reinforcement Schedules

138
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
138
Reinforcement01:23

Reinforcement

190
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
190
Law of Effect01:06

Law of Effect

1.3K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
1.3K
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

490
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
490

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Integrative learning of individualized treatment rules from multiple studies with partially overlapping treatments.

Biometrics·2026
Same author

SEMIPARAMETRIC ANALYSIS OF INTERVAL-CENSORED DATA SUBJECT TO INACCURATE DIAGNOSES WITH A TERMINAL EVENT.

The annals of applied statistics·2026
Same author

DYNAMIC CLASSIFICATION OF LATENT DISEASE PROGRESSION WITH AUXILIARY SURROGATE LABELS.

The annals of applied statistics·2026
Same author

Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection.

Journal of machine learning research : JMLR·2026
Same author

Data fusion methods for the heterogeneity of treatment effect and confounding function.

Bernoulli : official journal of the Bernoulli Society for Mathematical Statistics and Probability·2026
Same author

Leveraging precision medicine analytics to optimize inflammation reduction and enhance physical function in older adults.

The journals of gerontology. Series A, Biological sciences and medical sciences·2026
Same journal

A Bayesian functional concurrent zero-inflated Dirichlet-multinomial regression model with application to infant microbiome.

Biostatistics (Oxford, England)·2026
Same journal

Towards optimal environmental policies: policy learning under arbitrary bipartite network interference.

Biostatistics (Oxford, England)·2026
Same journal

Multilevel functional quantile principal component analysis.

Biostatistics (Oxford, England)·2026
Same journal

Adaptive transfer learning for time-to-event modeling with applications in disease risk assessment.

Biostatistics (Oxford, England)·2026
Same journal

High-dimensional test for one-sided hypotheses.

Biostatistics (Oxford, England)·2026
Same journal

NBSR: a Negative Binomial Softmax Regression model for microRNA-seq data analysis.

Biostatistics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Jun 14, 2025

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
07:05

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

5.9K

HMM for discovering decision-making dynamics using reinforcement learning experiments.

Xingche Guo1, Donglin Zeng2, Yuanjia Wang1,3

  • 1Department of Biostatistics, Columbia University, 722 West 168th St, New York, NY, 10032, United States.

Biostatistics (Oxford, England)
|September 3, 2024
PubMed
Summary
This summary is machine-generated.

Major depressive disorder (MDD) patients show altered reward learning strategies. A new model reveals MDD individuals engage less in reinforcement learning (RL) compared to controls, impacting decision-making.

Keywords:
behavioral phenotypingbrain–behavior associationmental healthreinforcement learningreward tasksstate-switching

More Related Videos

An Automated T-maze Based Apparatus and Protocol for Analyzing Delay- and Effort-based Decision Making in Free Moving Rodents
07:42

An Automated T-maze Based Apparatus and Protocol for Analyzing Delay- and Effort-based Decision Making in Free Moving Rodents

Published on: August 2, 2018

13.5K
A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents
06:25

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

Published on: May 16, 2025

114

Related Experiment Videos

Last Updated: Jun 14, 2025

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
07:05

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

5.9K
An Automated T-maze Based Apparatus and Protocol for Analyzing Delay- and Effort-based Decision Making in Free Moving Rodents
07:42

An Automated T-maze Based Apparatus and Protocol for Analyzing Delay- and Effort-based Decision Making in Free Moving Rodents

Published on: August 2, 2018

13.5K
A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents
06:25

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

Published on: May 16, 2025

114

Area of Science:

  • Computational psychiatry
  • Neuroscience
  • Behavioral economics

Background:

  • Major depressive disorder (MDD) is a leading cause of disability, with diagnosis and treatment complicated by its heterogeneity.
  • Abnormalities in reward processing are increasingly recognized as potential behavioral markers for MDD.
  • Traditional reinforcement learning (RL) models may not fully capture the complexity of decision-making in MDD, suggesting strategy switching.

Purpose of the Study:

  • To investigate how decision-making strategy dynamics influence reward learning in individuals with MDD.
  • To propose and validate a novel computational framework for analyzing reward-based decision-making in MDD.

Main Methods:

  • Developed a novel reinforcement learning-hidden Markov model (RL-HMM) framework to analyze reward-based decision-making.
  • The RL-HMM accommodates strategy switching between RL-based choices and random choices, with a continuous RL state space and time-varying transitions.
  • Employed an efficient Expectation-maximization (EM) algorithm for parameter estimation and nonparametric bootstrap for statistical inference.

Main Results:

  • The RL-HMM framework demonstrated robust performance in extensive simulation studies.
  • Application to the Establishing Moderators and Biosignatures of Antidepressant Response in Clinical Care (EMBARC) study revealed reduced RL engagement in MDD patients compared to healthy controls.
  • Lower RL engagement in MDD was associated with brain activity in negative affect circuitry during an emotional conflict task.

Conclusions:

  • The proposed RL-HMM framework effectively models strategy switching in reward-based decision-making.
  • MDD is characterized by altered reward learning dynamics, specifically reduced engagement in reinforcement learning.
  • These findings link reward processing abnormalities in MDD to neural activity in affective circuits.