Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Hierarchy of Motor Control01:18

Hierarchy of Motor Control

6.6K
The hierarchy of motor control refers to the different levels of organization and processing involved in controlling movement in the body. These levels range from higher cortical areas involved in planning and decision-making to lower spinal cord reflexes that respond automatically to external stimuli.
6.6K
Observational Learning01:12

Observational Learning

1.1K
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
1.1K
Stereotype Content Model02:16

Stereotype Content Model

15.6K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
15.6K
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

380
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
380
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

455
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
455

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Machine learning high-throughput screening of rare earth SACs with different coordination environments for the HER.

Chemical communications (Cambridge, England)·2025
Same author

ProAttUnet: Advancing protein secondary structure prediction with deep learning via U-Net dual-pathway feature fusion and ESM2 pretrained protein language model.

Computational biology and chemistry·2025
Same author

Reinforced Metapath Optimization in Heterogeneous Information Networks for Drug-Target Interaction Prediction.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same author

Effect of Different N/C Coordination Electronic Structures on the Activity of Bifunctional Rare-Earth Ytterbium Electrocatalysts for Oxygen Electrodes.

Langmuir : the ACS journal of surfaces and colloids·2024
Same author

Co/Ce-MOF-Derived Oxygen Electrode Bifunctional Catalyst for Rechargeable Zinc-Air Batteries.

Inorganic chemistry·2024
Same author

Machine Learning-Assisted Study of REN<sub></sub>C<sub>6-</sub>-Doped Graphene as Potential Electrocatalysts for Oxygen Electrode Reactions.

Langmuir : the ACS journal of surfaces and colloids·2024
Same journal

RETRACTION: Real-Time Modulation of Physical Training Intensity Based on Wavelet Recursive Fuzzy Neural Networks.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Multidimensional Heterogeneous Network Link Adaptation Based on Mobile Environment.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Framework to Segment and Evaluate Multiple Sclerosis Lesion in MRI Slices Using VGG-UNet.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Facial Emotion Recognition Using a Novel Fusion of Convolutional Neural Network and Local Binary Pattern in Crime Investigation.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Automatic Intelligent System Using Medical of Things for Multiple Sclerosis Detection.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Intangible Cultural Heritage Reproduction and Revitalization: Value Feedback, Practice, and Exploration Based on the IPA Model.

Computational intelligence and neuroscience·2026
See all related articles

Related Experiment Videos

Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.

Shan Zhong1, Quan Liu2, QiMing Fu3

  • 1School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215000, China; School of Computer Science and Engineering, Changshu Institute of Technology, Changshu, Jiangsu 215500, China.

Computational Intelligence and Neuroscience
|November 1, 2016
PubMed
Summary
This summary is machine-generated.

Two new methods, actor-critic hierarchical model learning and planning (AC-HMLP) and its regularized version (RAC-HMLP), enhance reinforcement learning convergence and sample efficiency. These methods effectively combine local and global information for superior performance.

Related Experiment Videos

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Reinforcement Learning

Background:

  • Reinforcement Learning (RL) algorithms often face challenges with slow convergence and low sample efficiency.
  • Hierarchical approaches can improve RL performance by structuring learning and planning processes.

Purpose of the Study:

  • To propose and evaluate two novel efficient learning methods, AC-HMLP and RAC-HMLP, for enhancing convergence rate and sample efficiency in RL.
  • To leverage hierarchical models for improved information utilization in RL algorithms.

Main Methods:

  • Developed AC-HMLP and RAC-HMLP by integrating actor-critic algorithms with hierarchical model learning and planning.
  • Employed local linear regression (LLR) for local models and linear function approximation (LFA) for global models within a hierarchical structure.
  • Utilized both local and global models for sample generation during planning, with conditional application of the local model based on state-prediction error.

Main Results:

  • AC-HMLP and RAC-HMLP demonstrated superior performance compared to three representative RL algorithms on benchmark problems.
  • The proposed methods achieved significant improvements in both convergence rate and sample efficiency.
  • The integration of local and global models effectively utilized information for accelerated learning.

Conclusions:

  • AC-HMLP and RAC-HMLP represent effective advancements in RL, offering improved convergence and sample efficiency.
  • Hierarchical model learning and planning, combined with actor-critic methods, provide a robust framework for complex RL tasks.