Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Model-based reinforcement learning for partially observable games with sampling-based state estimation.

Hajime Fujita1, Shin Ishii

  • 1hajime-f@is.naist.jp

Neural Computation
|September 22, 2007
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Template-Based Label Propagation for Mouse Brain MRI Skull Stripping.

Neuroinformatics·2026
Same author

CRTC1 knockdown in the marmoset visual cortex induces neuronal IEG overexpression, HFOs, and neurodegeneration.

Neuroscience research·2026
Same author

Brain/MINDS Marmoset Brain Atlas 2.0: Population Cortical Parcellation With Multi-Modal Templates.

Scientific data·2026
Same author

Blaming luck, claiming skill: Self-attribution bias in error assignment.

PLoS computational biology·2025
Same author

Decoding Confidence in Future Event: EEG Markers of Prospective Confidence in Perceptual and Memory Tasks.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Neural mechanisms of individual differences in prior weight during scene recognition.

NeuroImage·2025
Same journal

A Model-Free Reinforcement Learning Implementation of Decision Making Under Uncertainty by Sequential Sampling.

Neural computation·2026
Same journal

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning.

Neural computation·2026
Same journal

Hierarchical Active Inference Using Successor Representations.

Neural computation·2026
Same journal

W-Kernel and Its Principal Space for Frequentist Evaluation of Bayesian Estimators.

Neural computation·2026
Same journal

A Hidden Markov Model-Inspired Sequence Classification Method for Hyperdimensional Computing.

Neural computation·2026
Same journal

Sparse Graphical Modeling for Electrophysiological Phase-Based Connectivity Using Circular Statistics.

Neural computation·2026
See all related articles

This study introduces a model-based reinforcement learning (RL) approach for complex multiagent games with hidden information. The method effectively reduces computational costs, enabling AI agents to learn strategies in challenging environments.

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Game Theory

Background:

  • Multiagent systems present significant reinforcement learning (RL) challenges due to large state spaces and partial observability.
  • Real-world applications like complex games suffer from high computational costs in state estimation and prediction.
  • Effective approximation methods are crucial for enabling agents to learn in unknown, partially observable environments.

Purpose of the Study:

  • To develop a model-based RL scheme for large-scale multiagent problems with partial observability.
  • To address the computational intractability of estimating and predicting in vast, unobservable state spaces.
  • To enable AI agents to learn effective strategies in imperfect information games.

Main Methods:

  • A model-based reinforcement learning (RL) framework was designed for partially observable multiagent problems.

Related Experiment Videos

  • A sampling technique was employed to approximate complex estimations and predictions, reducing computational load.
  • The proposed method was applied to the card game Hearts, a representative imperfect information game.
  • Main Results:

    • The model-based RL scheme demonstrated effectiveness in solving complex, partially observable multiagent problems.
    • The sampling technique significantly reduced the computational cost associated with state estimation and prediction.
    • Simulations confirmed the method's ability to enable learning in challenging game environments.

    Conclusions:

    • The developed model-based RL approach offers a viable solution for strategic learning in large-scale, partially observable multiagent systems.
    • Approximation via sampling is a key technique for managing computational complexity in such domains.
    • The method shows promise for applications beyond card games, including other imperfect information scenarios.