Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Associative Learning01:27

Associative Learning

2.1K
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
2.1K
Reinforcement01:23

Reinforcement

1.2K
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
1.2K
Avoidance Learning and Learned Helplessness01:14

Avoidance Learning and Learned Helplessness

4.2K
Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...
4.2K
Primary and Secondary Reinforcers01:23

Primary and Secondary Reinforcers

1.8K
In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...
1.8K
Reinforcement Schedules01:24

Reinforcement Schedules

743
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
743
Observational Learning01:12

Observational Learning

1.5K
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
1.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Statistics and AI - A Fireside Conversation.

Harvard data science reviewยท2026
Same authorSame journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Associationยท2026
Same author

Predicting the timing of first sustained cognitive worsening in Alzheimer's disease using real-world clinical data and machine learning.

medRxiv : the preprint server for health sciencesยท2026
Same author

Nonparametric estimation of the total treatment effect with multiple outcomes in the presence of terminal events.

Biometricsยท2026
Same author

Scalable Gaussian process regression via median posterior inference for estimating the health effects of an environmental mixture.

Biometricsยท2026
Same author

Large-scale antibody reactome profiling identifies herpesvirus-autoantigen associations underlying chronic diseases.

Research squareยท2026
Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Associationยท2026
Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Associationยท2026
Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Associationยท2026
Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Associationยท2026
Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Associationยท2026
See all related articles

Related Experiment Video

Updated: May 5, 2026

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

6.2K

Federated Offline Reinforcement Learning.

Doudou Zhou1, Yufeng Zhang2, Aaron Sonabend-W1

  • 1Department of Biostatistics, Harvard T.H. Chan School of Public Health.

Journal of the American Statistical Association
|February 20, 2026
PubMed
Summary
This summary is machine-generated.

Federated offline reinforcement learning (RL) enables personalized medicine using distributed healthcare data. This new algorithm optimizes treatment policies efficiently across multiple sites, achieving performance comparable to centralized data.

Keywords:
dynamic treatment regimeselectrical health recordsmulti-source learning

Related Experiment Videos

Last Updated: May 5, 2026

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

6.2K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Healthcare Informatics

Background:

  • Personalized medicine requires dynamic treatment regimes, often leveraging offline reinforcement learning (RL).
  • Sharing sensitive healthcare data across institutions is restricted due to privacy concerns and site-specific data heterogeneity.
  • Existing methods struggle to utilize distributed datasets effectively for developing robust treatment strategies.

Purpose of the Study:

  • To develop a novel federated offline RL framework addressing privacy and heterogeneity in multi-site healthcare data.
  • To enable the analysis of site-level features within a unified model.
  • To design a communication-efficient algorithm for optimizing dynamic treatment regimes.

Main Methods:

  • Proposed a multi-site Markov decision process model accommodating both homogeneous and heterogeneous site effects.
  • Developed the first federated policy optimization algorithm for offline RL with guaranteed sample complexity.
  • Algorithm requires only a single round of communication via summary statistics exchange.

Main Results:

  • The proposed federated offline RL algorithm demonstrates theoretical guarantees on policy suboptimality, comparable to centralized data scenarios.
  • Extensive simulations confirm the algorithm's effectiveness in learning optimal policies.
  • The method was successfully applied to a multi-site sepsis dataset.

Conclusions:

  • Federated offline RL is a viable approach for personalized medicine with distributed, private healthcare data.
  • The proposed algorithm offers an efficient and effective solution for multi-site treatment regime optimization.
  • This work facilitates the clinical application of advanced RL techniques in real-world healthcare settings.