Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Estimating causal effects with optimization-based methods: A review and empirical comparison.

European journal of operational research·2025

Same author

Advancing science- and evidence-based AI policy.

Science (New York, N.Y.)·2025

Same author

A flexible machine learning Mendelian randomization estimator applied to predict the safety and efficacy of sclerostin inhibition.

American journal of human genetics·2025

Same author

Circulating proteins to predict COVID-19 severity.

Scientific reports·2023

Same author

Biomedical Research and Informatics Living Laboratory for Innovative Advances of New Technologies in Community Mobility Rehabilitation: Protocol for Evaluation and Rehabilitation of Mobility Across Continuums of Care.

JMIR research protocols·2022

Same author

COVID-19 Prognosis via Self-Supervised Representation Learning and Multi-Image Prediction.

ArXiv·2021

Same journal

Calibrated and Conformal Propensity Scores for Causal Effect Estimation.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2025

Same journal

Non-stationary Domain Generalization: Theory and Algorithm.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2025

Same journal

Offline Reward Perturbation Boosts Distributional Shift in Online RL.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2024

Same journal

A Variational Approximation for Analyzing the Dynamics of Panel Data.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2021

Same journal

Graph Reparameterizations for Enabling 1000+ Monte Carlo Iterations in Bayesian Deep Neural Networks.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2021

Same journal

Sampling-free Uncertainty Estimation in Gated Recurrent Units with Applications to Normative Modeling in Neuroimaging.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2020

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 30, 2026

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Model-Based Bayesian Reinforcement Learning in Large Structured Domains.

Stéphane Ross¹, Joelle Pineau¹

¹School of Computer Science, McGill University, Montreal, Canada.

Uncertainty in Artificial Intelligence : Proceedings of the ... Conference. Conference on Uncertainty in Artificial Intelligence

|November 6, 2015

Summary

This summary is machine-generated.

Model-based Bayesian reinforcement learning (RL) offers optimal exploration-exploitation solutions. This study introduces a scalable Bayesian framework for learning dynamical systems and planning actions simultaneously.

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Related Experiment Videos

Last Updated: Mar 30, 2026

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Area of Science:

Artificial Intelligence
Machine Learning
Reinforcement Learning

Background:

Model-based Bayesian reinforcement learning (RL) is of significant interest for addressing the exploration-exploitation tradeoff.
Current methods face scalability limitations due to the complexity of posterior inference over model parameters.

Purpose of the Study:

To develop a scalable Bayesian framework for model-based reinforcement learning.
To simultaneously learn the structure and parameters of a dynamical system and plan actions.

Main Methods:

Utilized factored representations to manage model complexity.
Integrated online planning techniques with Bayesian inference.
Developed a novel Bayesian framework for joint learning and planning.

Main Results:

The proposed framework improves the scalability of Bayesian reinforcement learning.
Demonstrated the ability to learn dynamical system structure and parameters.
Enabled simultaneous planning of near-optimal action sequences.

Conclusions:

The developed Bayesian framework enhances the practical applicability of model-based RL in larger domains.
This approach offers a unified solution for learning and planning in dynamical systems.