Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

190
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
190
Reinforcement Schedules01:24

Reinforcement Schedules

138
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
138
Law of Effect01:06

Law of Effect

1.3K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
1.3K
Bootstrapping01:24

Bootstrapping

592
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
592
Primary and Secondary Reinforcers01:23

Primary and Secondary Reinforcers

218
In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...
218
Modeling in Therapy01:26

Modeling in Therapy

59
Modeling, a key technique in therapy, uses observational learning to help clients acquire and practice new skills by watching therapists demonstrate desired behaviors. This approach, rooted in Albert Bandura's concept of vicarious learning, plays a significant role in therapeutic interventions for various psychological conditions, including social anxiety, ADHD, and depression.
Participant Modeling
Participant modeling involves therapists demonstrating calm and effective behaviors in...
59

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Auricular Acupressure for Preventing Postoperative Catheter-Related Bladder Discomfort in Male Patients Undergoing Spinal Surgery: A Randomized Controlled Trial.

Nursing research and practice·2026
Same author

Qualitative Analysis of User Experiences of a mHealth Self-Care Intervention for Care Partners of Individuals with Traumatic Brain Injury.

Archives of rehabilitation research and clinical translation·2026
Same author

FeS Colloids Trigger Antimony Redox Cycling, Colloid Formation, and Ultimate Fate during the Anoxic-Oxic Transition.

Environmental science & technology·2026
Same author

Impurity Profiling of an Antibody-Antibiotic Conjugate (AAC) Drug-Linker Using Multiple 2D-LC Modes.

Journal of separation science·2026
Same author

Crotonylation impedes c-Myc oncogenic activity.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Common Otolaryngologic Clinical Manifestations and Symptoms of Langerhans Cell Histiocytosis and Evaluation of Treatment Regimens in Pediatric Patients: A Systematic Review and Meta-Analysis.

International journal of pediatric otorhinolaryngology·2026
Same journal

Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification.

Machine learning·2026
Same journal

Linear Causal Discovery with Interventional Constraints.

Machine learning·2026
Same journal

Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models.

Machine learning·2025
Same journal

Mining exceptional social behavior on attributed interaction networks.

Machine learning·2025
Same journal

Persistent Laplacian-enhanced algorithm for scarcely labeled data classification.

Machine learning·2025
Same journal

Ensuring medical AI safety: interpretability-driven detection and mitigation of spurious model behavior and associated data.

Machine learning·2025
See all related articles

Related Experiment Video

Updated: Jun 14, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

4.3K

Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling.

Susobhan Ghosh1, Raphael Kim2, Prasidh Chhabria3

  • 1Department of Computer Science, Harvard University.

Machine Learning
|September 2, 2024
PubMed
Summary
This summary is machine-generated.

Reinforcement learning (RL) can personalize digital health treatments, but its effectiveness needs validation. This study introduces a method to confirm if RL personalization is genuine or an artifact of its inherent randomness.

Keywords:
Reinforcement learningexploratory data analysismobile healthpersonalizationresampling

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.7K
Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

7.5K

Related Experiment Videos

Last Updated: Jun 14, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

4.3K
Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.7K
Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

7.5K

Area of Science:

  • Digital Health Interventions
  • Machine Learning in Healthcare
  • Behavioral Science

Background:

  • Growing interest in using reinforcement learning (RL) for personalized digital health treatments to promote healthier behaviors.
  • Sequential decision-making in digital health relies on user context for treatment timing and type.
  • Online RL offers a data-driven approach to personalize treatments based on user responses.

Purpose of the Study:

  • To assess the data evidence confirming that RL algorithms genuinely personalize treatments for users.
  • To investigate if observed RL personalization is an artifact of the algorithm's stochasticity.
  • To introduce a resampling-based methodology for evaluating RL personalization.

Main Methods:

  • Developed a working definition of personalization for RL algorithms.
  • Introduced a resampling-based methodology to distinguish genuine personalization from algorithmic stochasticity.
  • Applied the methodology to data from the HeartSteps physical activity clinical trial.

Main Results:

  • Demonstrated a method to rigorously evaluate the personalization capabilities of online RL algorithms.
  • The case study illustrated how the approach enhances the assessment of algorithm personalization.
  • Personalization was evaluated both across all users and within individual user data.

Conclusions:

  • The proposed methodology provides a framework for data-driven truth-in-advertising of RL personalization in digital health.
  • Validating RL personalization is crucial for the ethical and effective deployment of optimized digital health interventions.
  • This approach helps ensure that RL algorithms provide meaningful, individualized support for behavior change.