Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

931
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
931
Higher Mental Functions of Brain: Learning and Memory01:26

Higher Mental Functions of Brain: Learning and Memory

2.1K
Memory is one of the most vital higher mental functions of the brain. Memory is closely related to learning because it enables us to retain information and experiences from our past to use them in our present life. It also helps us to remember facts, events, and skills, such as riding a bike or swimming. There are two types of memory — declarative memory, which involves memorizing facts or events, and procedural memory, which enables us to remember how to do something like writing or...
2.1K
Reinforcements in Concrete01:25

Reinforcements in Concrete

475
Reinforced concrete is a composite material used extensively in construction, combining the compressive strength of concrete with the tensile strength of steel. This synergy is essential as concrete, while excellent at resisting compression, is weak under tension. Steel bars, or rebars, are embedded in the concrete to handle these tensile forces. The choice of steel is strategic; it shares a similar coefficient of thermal expansion with concrete, which ensures uniformity in response to...
475
Corrosion of Reinforcement01:27

Corrosion of Reinforcement

584
The corrosion of steel reinforcement within concrete is a process influenced by the material's inherent properties and external factors. The high pH level of around 13, provided by calcium hydroxide present in concrete, initially protects the steel reinforcement by promoting the formation of a passive iron oxide layer on its surface.
However, over time and under certain conditions like carbonation, chloride ingress, and cracking this protective state can be compromised. Steel has areas with...
584
Reinforcement Schedules01:24

Reinforcement Schedules

509
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
509
What are Estimates?01:06

What are Estimates?

8.8K
It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates. 
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...
8.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Template-Based Label Propagation for Mouse Brain MRI Skull Stripping.

Neuroinformatics·2026
Same author

CRTC1 knockdown in the marmoset visual cortex induces neuronal IEG overexpression, HFOs, and neurodegeneration.

Neuroscience research·2026
Same author

Brain/MINDS Marmoset Brain Atlas 2.0: Population Cortical Parcellation With Multi-Modal Templates.

Scientific data·2026
Same author

Blaming luck, claiming skill: Self-attribution bias in error assignment.

PLoS computational biology·2025
Same author

Decoding Confidence in Future Event: EEG Markers of Prospective Confidence in Perceptual and Memory Tasks.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Neural mechanisms of individual differences in prior weight during scene recognition.

NeuroImage·2025
Same journal

Strategic Ability Updating in Concurrent Games by Coalitional Commitment.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2015
Same journal

Meta-Analysis of the First Facial Expression Recognition Challenge.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
Same journal

Adjustable model-based fusion method for multispectral and panchromatic images.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
Same journal

Face Feature Weighted Fusion Based on Fuzzy Membership Degree for Video Face Recognition.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
Same journal

A New Adaptive Fast Cellular Automaton Neighborhood Detection and Rule Identification Algorithm.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
Same journal

Human-arm-and-hand-dynamic model with variability analyses for a stylus-based haptic interface.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
See all related articles

Related Experiment Video

Updated: Feb 7, 2026

Multiscale Structures Aggregated by Imprinted Nanofibers for Functional Surfaces
06:14

Multiscale Structures Aggregated by Imprinted Nanofibers for Functional Surfaces

Published on: September 11, 2018

7.0K

Incremental state aggregation for value function estimation in reinforcement learning.

Takeshi Mori1, Shin Ishii

  • 1Institute of Perception, Action and Behaviour, School of Informatics, The University of Edinburgh, Edinburgh, UK. takeshi.mori@ed.ac.uk

IEEE Transactions on Systems, Man, and Cybernetics. Part B, Cybernetics : a Publication of the IEEE Systems, Man, and Cybernetics Society
|June 3, 2011
PubMed
Summary
This summary is machine-generated.

This study introduces an efficient reinforcement learning method for estimating value functions in large state spaces. By decomposing problems into smaller parts, it significantly reduces computational cost compared to existing techniques.

More Related Videos

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.2K
Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring
08:47

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Published on: November 12, 2008

11.7K

Related Experiment Videos

Last Updated: Feb 7, 2026

Multiscale Structures Aggregated by Imprinted Nanofibers for Functional Surfaces
06:14

Multiscale Structures Aggregated by Imprinted Nanofibers for Functional Surfaces

Published on: September 11, 2018

7.0K
Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.2K
Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring
08:47

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Published on: November 12, 2008

11.7K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Reinforcement Learning

Background:

  • Estimating value functions in reinforcement learning is challenging due to large state and action spaces.
  • Basis functions are commonly used but require significant prior knowledge for preparation.
  • Existing adaptive basis function methods are computationally expensive.

Purpose of the Study:

  • To propose an efficient method for approximating value functions in reinforcement learning.
  • To overcome the computational cost associated with adaptive basis function construction.
  • To reduce the difficulty of preparing basis functions in reinforcement learning.

Main Methods:

  • Decomposing the value function approximation problem into smaller subproblems.
  • Solving each subproblem with reduced computational cost.
  • Utilizing a linear combination of basis functions for value function representation.

Main Results:

  • The proposed method significantly reduces CPU time compared to existing techniques.
  • Efficient approximation of value functions is achieved even with large state spaces.
  • Subproblem decomposition leads to manageable computational loads.

Conclusions:

  • The novel approach offers a computationally efficient solution for value function approximation in reinforcement learning.
  • This method addresses the limitations of prior knowledge requirements and high computational costs of existing techniques.
  • The decomposition strategy proves effective in optimizing reinforcement learning computations.