Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

Expected Value

Expected Value

The expected value is known as the "long-term" average or mean. This means that over the long term of experimenting over and over, you would expect this average. The expected average is represented by the symbol μ. It is calculated as follows:

Decision Making: P-value Method

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim is also stated. These statements can act as null and alternative hypotheses: a null hypothesis would be a neutral statement while the alternative hypothesis can...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Fast reconstruction of degenerate populations of conductance-based neuron models from spike times.

PLoS computational biology·2026

Same author

Launching Austria's One Health network: paving the way for transdisciplinary collaborations.

One health outlook·2024

Same author

Warming up recurrent neural networks to maximise reachable multistability greatly improves learning.

Neural networks : the official journal of the International Neural Network Society·2023

Same author

Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments.

Sensors (Basel, Switzerland)·2022

Same author

A bio-inspired bistable recurrent cell allows for long-lasting memory.

PloS one·2021

Same author

The impact of different COVID-19 containment measures on electricity consumption in Europe.

Energy research & social science·2020

Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026

Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026

Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026

Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026

Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026

Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 19, 2026

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Benchmarking for Bayesian Reinforcement Learning.

Michael Castronovo¹, Damien Ernst¹, Adrien Couëtoux¹

¹Systems and Modeling, Montefiore Institute, University of Liege, Liege, Belgium.

|June 16, 2016

Summary

This summary is machine-generated.

This study introduces a new methodology and open-source library for comparing Bayesian Reinforcement Learning (BRL) algorithms. It addresses limitations of existing benchmarks by evaluating performance across diverse Markov Decision Processes (MDPs) and analyzing computational time.

More Related Videos

Three Laboratory Procedures for Assessing Different Manifestations of Impulsivity in Rats

Three Laboratory Procedures for Assessing Different Manifestations of Impulsivity in Rats

Published on: March 17, 2019

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Related Experiment Videos

Last Updated: Mar 19, 2026

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Three Laboratory Procedures for Assessing Different Manifestations of Impulsivity in Rats

Three Laboratory Procedures for Assessing Different Manifestations of Impulsivity in Rats

Published on: March 17, 2019

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Area of Science:

Artificial Intelligence
Machine Learning
Computational Neuroscience

Background:

Bayesian Reinforcement Learning (BRL) agents leverage prior knowledge to maximize rewards.
Existing benchmarks for BRL algorithms are often limited in scope and applicability.
A standardized and comprehensive comparison methodology is needed for BRL algorithm development.

Purpose of the Study:

To address the limitations of current BRL algorithm benchmarks.
To introduce a novel methodology for comparing BRL algorithms.
To provide an open-source library for facilitating reproducible BRL research.

Main Methods:

Defined a comparison criterion evaluating algorithm performance on large sets of Markov Decision Processes (MDPs) sampled from probability distributions.
Incorporated analysis of computation time requirements to enable comparison of non-anytime algorithms.
Developed an open-source library including test problems, prior distributions, and state-of-the-art BRL algorithms.

Main Results:

The developed methodology provides a robust framework for evaluating BRL algorithms.
The open-source library facilitates standardized benchmarking and reproducible research.
Comparative analysis of seven state-of-the-art algorithms was performed and results were discussed.

Conclusions:

The new methodology and library significantly advance the field of Bayesian Reinforcement Learning.
This work enables more reliable and comprehensive evaluation of BRL algorithms.
The open-source release promotes wider adoption and further development in BRL research.