Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Avoidance Learning and Learned Helplessness

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...

Primary and Secondary Reinforcers

Primary and Secondary Reinforcers

In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Statistics and AI - A Fireside Conversation.

Harvard data science review·2026

Same authorSame journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026

Same author

Predicting the timing of first sustained cognitive worsening in Alzheimer's disease using real-world clinical data and machine learning.

medRxiv : the preprint server for health sciences·2026

Same author

Nonparametric estimation of the total treatment effect with multiple outcomes in the presence of terminal events.

Biometrics·2026

Same author

Scalable Gaussian process regression via median posterior inference for estimating the health effects of an environmental mixture.

Biometrics·2026

Same author

Large-scale antibody reactome profiling identifies herpesvirus-autoantigen associations underlying chronic diseases.

Research square·2026

Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026

Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026

Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026

Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026

Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 5, 2026

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Federated Offline Reinforcement Learning.

Doudou Zhou¹, Yufeng Zhang², Aaron Sonabend-W¹

¹Department of Biostatistics, Harvard T.H. Chan School of Public Health.

Journal of the American Statistical Association

|February 20, 2026

Summary

This summary is machine-generated.

Federated offline reinforcement learning (RL) enables personalized medicine using distributed healthcare data. This new algorithm optimizes treatment policies efficiently across multiple sites, achieving performance comparable to centralized data.

Keywords:

dynamic treatment regimes electrical health records multi-source learning

Related Experiment Videos

Last Updated: May 5, 2026

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Area of Science:

Artificial Intelligence
Machine Learning
Healthcare Informatics

Background:

Personalized medicine requires dynamic treatment regimes, often leveraging offline reinforcement learning (RL).
Sharing sensitive healthcare data across institutions is restricted due to privacy concerns and site-specific data heterogeneity.
Existing methods struggle to utilize distributed datasets effectively for developing robust treatment strategies.

Purpose of the Study:

To develop a novel federated offline RL framework addressing privacy and heterogeneity in multi-site healthcare data.
To enable the analysis of site-level features within a unified model.
To design a communication-efficient algorithm for optimizing dynamic treatment regimes.

Main Methods:

Proposed a multi-site Markov decision process model accommodating both homogeneous and heterogeneous site effects.
Developed the first federated policy optimization algorithm for offline RL with guaranteed sample complexity.
Algorithm requires only a single round of communication via summary statistics exchange.

Main Results:

The proposed federated offline RL algorithm demonstrates theoretical guarantees on policy suboptimality, comparable to centralized data scenarios.
Extensive simulations confirm the algorithm's effectiveness in learning optimal policies.
The method was successfully applied to a multi-site sepsis dataset.

Conclusions:

Federated offline RL is a viable approach for personalized medicine with distributed, private healthcare data.
The proposed algorithm offers an efficient and effective solution for multi-site treatment regime optimization.
This work facilitates the clinical application of advanced RL techniques in real-world healthcare settings.