Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Hierarchy of Motor Control

Hierarchy of Motor Control

The hierarchy of motor control refers to the different levels of organization and processing involved in controlling movement in the body. These levels range from higher cortical areas involved in planning and decision-making to lower spinal cord reflexes that respond automatically to external stimuli.

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Stereotype Content Model

Stereotype Content Model

The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Machine learning high-throughput screening of rare earth SACs with different coordination environments for the HER.

Chemical communications (Cambridge, England)·2025

Same author

ProAttUnet: Advancing protein secondary structure prediction with deep learning via U-Net dual-pathway feature fusion and ESM2 pretrained protein language model.

Computational biology and chemistry·2025

Same author

Reinforced Metapath Optimization in Heterogeneous Information Networks for Drug-Target Interaction Prediction.

IEEE/ACM transactions on computational biology and bioinformatics·2024

Same author

Effect of Different N/C Coordination Electronic Structures on the Activity of Bifunctional Rare-Earth Ytterbium Electrocatalysts for Oxygen Electrodes.

Langmuir : the ACS journal of surfaces and colloids·2024

Same author

Co/Ce-MOF-Derived Oxygen Electrode Bifunctional Catalyst for Rechargeable Zinc-Air Batteries.

Inorganic chemistry·2024

Same author

Machine Learning-Assisted Study of REN<sub></sub>C<sub>6-</sub>-Doped Graphene as Potential Electrocatalysts for Oxygen Electrode Reactions.

Langmuir : the ACS journal of surfaces and colloids·2024

Same journal

RETRACTION: Real-Time Modulation of Physical Training Intensity Based on Wavelet Recursive Fuzzy Neural Networks.

Computational intelligence and neuroscience·2026

Same journal

RETRACTION: Multidimensional Heterogeneous Network Link Adaptation Based on Mobile Environment.

Computational intelligence and neuroscience·2026

Same journal

RETRACTION: Framework to Segment and Evaluate Multiple Sclerosis Lesion in MRI Slices Using VGG-UNet.

Computational intelligence and neuroscience·2026

Same journal

RETRACTION: Facial Emotion Recognition Using a Novel Fusion of Convolutional Neural Network and Local Binary Pattern in Crime Investigation.

Computational intelligence and neuroscience·2026

Same journal

RETRACTION: Automatic Intelligent System Using Medical of Things for Multiple Sclerosis Detection.

Computational intelligence and neuroscience·2026

Same journal

RETRACTION: Intangible Cultural Heritage Reproduction and Revitalization: Value Feedback, Practice, and Exploration Based on the IPA Model.

Computational intelligence and neuroscience·2026

See all related articles

Search research articles

Related Experiment Videos

Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.

Shan Zhong¹, Quan Liu², QiMing Fu³

¹School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215000, China; School of Computer Science and Engineering, Changshu Institute of Technology, Changshu, Jiangsu 215500, China.

Computational Intelligence and Neuroscience

|November 1, 2016

Summary

This summary is machine-generated.

Two new methods, actor-critic hierarchical model learning and planning (AC-HMLP) and its regularized version (RAC-HMLP), enhance reinforcement learning convergence and sample efficiency. These methods effectively combine local and global information for superior performance.

Related Experiment Videos

Area of Science:

Artificial Intelligence
Machine Learning
Reinforcement Learning

Background:

Reinforcement Learning (RL) algorithms often face challenges with slow convergence and low sample efficiency.
Hierarchical approaches can improve RL performance by structuring learning and planning processes.

Purpose of the Study:

To propose and evaluate two novel efficient learning methods, AC-HMLP and RAC-HMLP, for enhancing convergence rate and sample efficiency in RL.
To leverage hierarchical models for improved information utilization in RL algorithms.

Main Methods:

Developed AC-HMLP and RAC-HMLP by integrating actor-critic algorithms with hierarchical model learning and planning.
Employed local linear regression (LLR) for local models and linear function approximation (LFA) for global models within a hierarchical structure.
Utilized both local and global models for sample generation during planning, with conditional application of the local model based on state-prediction error.

Main Results:

AC-HMLP and RAC-HMLP demonstrated superior performance compared to three representative RL algorithms on benchmark problems.
The proposed methods achieved significant improvements in both convergence rate and sample efficiency.
The integration of local and global models effectively utilized information for accelerated learning.

Conclusions:

AC-HMLP and RAC-HMLP represent effective advancements in RL, offering improved convergence and sample efficiency.
Hierarchical model learning and planning, combined with actor-critic methods, provide a robust framework for complex RL tasks.