Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Observational Learning01:12

Observational Learning

1.2K
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
1.2K
Reinforcement Schedules01:24

Reinforcement Schedules

690
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
690
Reinforcement01:23

Reinforcement

1.1K
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
1.1K
Associative Learning01:27

Associative Learning

1.9K
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
1.9K
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

397
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
397
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

484
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
484

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Estimating causal effects with optimization-based methods: A review and empirical comparison.

European journal of operational research·2025
Same author

Advancing science- and evidence-based AI policy.

Science (New York, N.Y.)·2025
Same author

A flexible machine learning Mendelian randomization estimator applied to predict the safety and efficacy of sclerostin inhibition.

American journal of human genetics·2025
Same author

Circulating proteins to predict COVID-19 severity.

Scientific reports·2023
Same author

Biomedical Research and Informatics Living Laboratory for Innovative Advances of New Technologies in Community Mobility Rehabilitation: Protocol for Evaluation and Rehabilitation of Mobility Across Continuums of Care.

JMIR research protocols·2022
Same author

COVID-19 Prognosis via Self-Supervised Representation Learning and Multi-Image Prediction.

ArXiv·2021
Same journal

Calibrated and Conformal Propensity Scores for Causal Effect Estimation.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2025
Same journal

Non-stationary Domain Generalization: Theory and Algorithm.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2025
Same journal

Offline Reward Perturbation Boosts Distributional Shift in Online RL.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2024
Same journal

A Variational Approximation for Analyzing the Dynamics of Panel Data.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2021
Same journal

Graph Reparameterizations for Enabling 1000+ Monte Carlo Iterations in Bayesian Deep Neural Networks.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2021
Same journal

Sampling-free Uncertainty Estimation in Gated Recurrent Units with Applications to Normative Modeling in Neuroimaging.

Uncertainty in artificial intelligence : proceedings of the ... conference. Conference on Uncertainty in Artificial Intelligence·2020
See all related articles

Related Experiment Video

Updated: Mar 30, 2026

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

10.0K

Model-Based Bayesian Reinforcement Learning in Large Structured Domains.

Stéphane Ross1, Joelle Pineau1

  • 1School of Computer Science, McGill University, Montreal, Canada.

Uncertainty in Artificial Intelligence : Proceedings of the ... Conference. Conference on Uncertainty in Artificial Intelligence
|November 6, 2015
PubMed
Summary
This summary is machine-generated.

Model-based Bayesian reinforcement learning (RL) offers optimal exploration-exploitation solutions. This study introduces a scalable Bayesian framework for learning dynamical systems and planning actions simultaneously.

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.3K

Related Experiment Videos

Last Updated: Mar 30, 2026

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

10.0K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.3K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Reinforcement Learning

Background:

  • Model-based Bayesian reinforcement learning (RL) is of significant interest for addressing the exploration-exploitation tradeoff.
  • Current methods face scalability limitations due to the complexity of posterior inference over model parameters.

Purpose of the Study:

  • To develop a scalable Bayesian framework for model-based reinforcement learning.
  • To simultaneously learn the structure and parameters of a dynamical system and plan actions.

Main Methods:

  • Utilized factored representations to manage model complexity.
  • Integrated online planning techniques with Bayesian inference.
  • Developed a novel Bayesian framework for joint learning and planning.

Main Results:

  • The proposed framework improves the scalability of Bayesian reinforcement learning.
  • Demonstrated the ability to learn dynamical system structure and parameters.
  • Enabled simultaneous planning of near-optimal action sequences.

Conclusions:

  • The developed Bayesian framework enhances the practical applicability of model-based RL in larger domains.
  • This approach offers a unified solution for learning and planning in dynamical systems.