Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Higher Mental Functions of Brain: Learning and Memory

Higher Mental Functions of Brain: Learning and Memory

Memory is one of the most vital higher mental functions of the brain. Memory is closely related to learning because it enables us to retain information and experiences from our past to use them in our present life. It also helps us to remember facts, events, and skills, such as riding a bike or swimming. There are two types of memory — declarative memory, which involves memorizing facts or events, and procedural memory, which enables us to remember how to do something like writing or...

Reinforcements in Concrete

Reinforcements in Concrete

Reinforced concrete is a composite material used extensively in construction, combining the compressive strength of concrete with the tensile strength of steel. This synergy is essential as concrete, while excellent at resisting compression, is weak under tension. Steel bars, or rebars, are embedded in the concrete to handle these tensile forces. The choice of steel is strategic; it shares a similar coefficient of thermal expansion with concrete, which ensures uniformity in response to...

Corrosion of Reinforcement

Corrosion of Reinforcement

The corrosion of steel reinforcement within concrete is a process influenced by the material's inherent properties and external factors. The high pH level of around 13, provided by calcium hydroxide present in concrete, initially protects the steel reinforcement by promoting the formation of a passive iron oxide layer on its surface.
However, over time and under certain conditions like carbonation, chloride ingress, and cracking this protective state can be compromised. Steel has areas with...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

What are Estimates?

What are Estimates?

It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates.
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Template-Based Label Propagation for Mouse Brain MRI Skull Stripping.

Neuroinformatics·2026

Same author

CRTC1 knockdown in the marmoset visual cortex induces neuronal IEG overexpression, HFOs, and neurodegeneration.

Neuroscience research·2026

Same author

Brain/MINDS Marmoset Brain Atlas 2.0: Population Cortical Parcellation With Multi-Modal Templates.

Scientific data·2026

Same author

Blaming luck, claiming skill: Self-attribution bias in error assignment.

PLoS computational biology·2025

Same author

Decoding Confidence in Future Event: EEG Markers of Prospective Confidence in Perceptual and Memory Tasks.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same author

Neural mechanisms of individual differences in prior weight during scene recognition.

NeuroImage·2025

Same journal

Strategic Ability Updating in Concurrent Games by Coalitional Commitment.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2015

Same journal

Meta-Analysis of the First Facial Expression Recognition Challenge.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

Same journal

Adjustable model-based fusion method for multispectral and panchromatic images.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

Same journal

Face Feature Weighted Fusion Based on Fuzzy Membership Degree for Video Face Recognition.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

Same journal

A New Adaptive Fast Cellular Automaton Neighborhood Detection and Rule Identification Algorithm.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

Same journal

Human-arm-and-hand-dynamic model with variability analyses for a stylus-based haptic interface.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 7, 2026

Multiscale Structures Aggregated by Imprinted Nanofibers for Functional Surfaces

Multiscale Structures Aggregated by Imprinted Nanofibers for Functional Surfaces

Published on: September 11, 2018

Incremental state aggregation for value function estimation in reinforcement learning.

Takeshi Mori¹, Shin Ishii

¹Institute of Perception, Action and Behaviour, School of Informatics, The University of Edinburgh, Edinburgh, UK. takeshi.mori@ed.ac.uk

IEEE Transactions on Systems, Man, and Cybernetics. Part B, Cybernetics : a Publication of the IEEE Systems, Man, and Cybernetics Society

|June 3, 2011

Summary

This summary is machine-generated.

This study introduces an efficient reinforcement learning method for estimating value functions in large state spaces. By decomposing problems into smaller parts, it significantly reduces computational cost compared to existing techniques.

More Related Videos

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Published on: November 12, 2008

Related Experiment Videos

Last Updated: Feb 7, 2026

Multiscale Structures Aggregated by Imprinted Nanofibers for Functional Surfaces

Multiscale Structures Aggregated by Imprinted Nanofibers for Functional Surfaces

Published on: September 11, 2018

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Published on: November 12, 2008

Area of Science:

Artificial Intelligence
Machine Learning
Reinforcement Learning

Background:

Estimating value functions in reinforcement learning is challenging due to large state and action spaces.
Basis functions are commonly used but require significant prior knowledge for preparation.
Existing adaptive basis function methods are computationally expensive.

Purpose of the Study:

To propose an efficient method for approximating value functions in reinforcement learning.
To overcome the computational cost associated with adaptive basis function construction.
To reduce the difficulty of preparing basis functions in reinforcement learning.

Main Methods:

Decomposing the value function approximation problem into smaller subproblems.
Solving each subproblem with reduced computational cost.
Utilizing a linear combination of basis functions for value function representation.

Main Results:

The proposed method significantly reduces CPU time compared to existing techniques.
Efficient approximation of value functions is achieved even with large state spaces.
Subproblem decomposition leads to manageable computational loads.

Conclusions:

The novel approach offers a computationally efficient solution for value function approximation in reinforcement learning.
This method addresses the limitations of prior knowledge requirements and high computational costs of existing techniques.
The decomposition strategy proves effective in optimizing reinforcement learning computations.