Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Law of Effect

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...

Bootstrapping

Bootstrapping

The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...

Primary and Secondary Reinforcers

Primary and Secondary Reinforcers

In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...

Modeling in Therapy

Modeling in Therapy

Modeling, a key technique in therapy, uses observational learning to help clients acquire and practice new skills by watching therapists demonstrate desired behaviors. This approach, rooted in Albert Bandura's concept of vicarious learning, plays a significant role in therapeutic interventions for various psychological conditions, including social anxiety, ADHD, and depression.
Participant Modeling
Participant modeling involves therapists demonstrating calm and effective behaviors in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Auricular Acupressure for Preventing Postoperative Catheter-Related Bladder Discomfort in Male Patients Undergoing Spinal Surgery: A Randomized Controlled Trial.

Nursing research and practice·2026

Same author

Qualitative Analysis of User Experiences of a mHealth Self-Care Intervention for Care Partners of Individuals with Traumatic Brain Injury.

Archives of rehabilitation research and clinical translation·2026

Same author

FeS Colloids Trigger Antimony Redox Cycling, Colloid Formation, and Ultimate Fate during the Anoxic-Oxic Transition.

Environmental science & technology·2026

Same author

Impurity Profiling of an Antibody-Antibiotic Conjugate (AAC) Drug-Linker Using Multiple 2D-LC Modes.

Journal of separation science·2026

Same author

Crotonylation impedes c-Myc oncogenic activity.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same author

Common Otolaryngologic Clinical Manifestations and Symptoms of Langerhans Cell Histiocytosis and Evaluation of Treatment Regimens in Pediatric Patients: A Systematic Review and Meta-Analysis.

International journal of pediatric otorhinolaryngology·2026

Same journal

Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification.

Machine learning·2026

Same journal

Linear Causal Discovery with Interventional Constraints.

Machine learning·2026

Same journal

Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models.

Machine learning·2025

Same journal

Mining exceptional social behavior on attributed interaction networks.

Machine learning·2025

Same journal

Persistent Laplacian-enhanced algorithm for scarcely labeled data classification.

Machine learning·2025

Same journal

Ensuring medical AI safety: interpretability-driven detection and mitigation of spurious model behavior and associated data.

Machine learning·2025

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 14, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling.

Susobhan Ghosh¹, Raphael Kim², Prasidh Chhabria³

¹Department of Computer Science, Harvard University.

Machine Learning

|September 2, 2024

Summary

This summary is machine-generated.

Reinforcement learning (RL) can personalize digital health treatments, but its effectiveness needs validation. This study introduces a method to confirm if RL personalization is genuine or an artifact of its inherent randomness.

Keywords:

Reinforcement learning exploratory data analysis mobile health personalization resampling

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Related Experiment Videos

Last Updated: Jun 14, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Area of Science:

Digital Health Interventions
Machine Learning in Healthcare
Behavioral Science

Background:

Growing interest in using reinforcement learning (RL) for personalized digital health treatments to promote healthier behaviors.
Sequential decision-making in digital health relies on user context for treatment timing and type.
Online RL offers a data-driven approach to personalize treatments based on user responses.

Purpose of the Study:

To assess the data evidence confirming that RL algorithms genuinely personalize treatments for users.
To investigate if observed RL personalization is an artifact of the algorithm's stochasticity.
To introduce a resampling-based methodology for evaluating RL personalization.

Main Methods:

Developed a working definition of personalization for RL algorithms.
Introduced a resampling-based methodology to distinguish genuine personalization from algorithmic stochasticity.
Applied the methodology to data from the HeartSteps physical activity clinical trial.

Main Results:

Demonstrated a method to rigorously evaluate the personalization capabilities of online RL algorithms.
The case study illustrated how the approach enhances the assessment of algorithm personalization.
Personalization was evaluated both across all users and within individual user data.

Conclusions:

The proposed methodology provides a framework for data-driven truth-in-advertising of RL personalization in digital health.
Validating RL personalization is crucial for the ethical and effective deployment of optimized digital health interventions.
This approach helps ensure that RL algorithms provide meaningful, individualized support for behavior change.