Search research articles

Related Concept Videos

Machines: Problem Solving II

Machines: Problem Solving II

Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Machines: Problem Solving I

Machines: Problem Solving I

A toggle clamp is a mechanical device commonly used for holding and clamping objects in various applications, such as woodworking, metalworking, and assembly operations. Consider a toggle clamp subjected to a force of 200 N at the handle. The vertical clamping force can be calculated, provided the dimensions of the toggle clamp are known.
The toggle clamp system is a machine structure consisting of movable, pin-connected multi-force members that form a stabilized system to transmit forces. The...

Woodward–Hoffmann Selection Rules and Microscopic Reversibility

Woodward–Hoffmann Selection Rules and Microscopic Reversibility

Electrocyclic reactions, cycloadditions, and sigmatropic rearrangements are concerted pericyclic reactions that proceed via a cyclic transition state. These reactions are stereospecific and regioselective. The stereochemistry of the products depends on the symmetry characteristics of the interacting orbitals and the reaction conditions. Accordingly, pericyclic reactions are classified as either symmetry-allowed or symmetry-forbidden. Woodward and Hoffmann presented the selection criteria for...

Social Loafing

Another way in which a group presence can affect performance is social loafing—the exertion of less effort by a person working together with a group. Social loafing occurs when our individual performance cannot be evaluated separately from the group. Thus, group performance declines on easy tasks (Karau & Williams, 1993). Essentially individual group members loaf and let other group members pick up the slack. Because each individual’s efforts cannot be evaluated,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

BaleUAVision: Hay Bales UAV Captured Dataset.

Scientific data·2026

Same author

Agentic LLM-based robotic systems for real-world applications: a review on their agenticness and ethics.

Frontiers in robotics and AI·2025

Same author

Delivering data: A real-world dataset for last-mile delivery optimization.

Data in brief·2025

Same author

Explainable Siamese Neural Networks for Detection of High Fall Risk Older Adults in the Community Based on Gait Analysis.

Journal of functional morphology and kinesiology·2025

Same author

On complexity of colloid cellular automata.

Scientific reports·2024

Same author

A Biologically Inspired Movement Recognition System with Spiking Neural Networks for Ambient Assisted Living Applications.

Biomimetics (Basel, Switzerland)·2024

Same journal

Passive wheels on legged robots: a survey.

Frontiers in robotics and AI·2026

Same journal

Politeness cannot make up for robots' errors.

Frontiers in robotics and AI·2026

Same journal

Workers expect basic social skills but limited autonomy from future robots - a qualitative interview study and taxonomy for robot social skills.

Frontiers in robotics and AI·2026

Same journal

Human-robot interaction in sustainable hospitality: how robot type shapes customer emotions, green perceptions, and service loyalty.

Frontiers in robotics and AI·2026

Same journal

Dynamic variance-aware federated tuning for efficient autonomous vehicle perception under non-IID settings.

Frontiers in robotics and AI·2026

Same journal

Editorial: Synergizing large language models and computational intelligence for advanced robotic systems.

Frontiers in robotics and AI·2026

See all related articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Video

Updated: Jul 6, 2025

Transcranial Direct Current Stimulation tDCS of Wernicke's and Broca's Areas in Studies of Language Learning and Word Acquisition

Transcranial Direct Current Stimulation tDCS of Wernicke's and Broca's Areas in Studies of Language Learning and Word Acquisition

Published on: July 13, 2019

Decomposing user-defined tasks in a reinforcement learning setup using TextWorld.

Thanos Petsanis¹, Christoforos Keroglou¹, Athanasios Ch Kapoutsis²

¹School of Engineering, Department of Electrical and Computer Engineering, Democritus University of Thrace (DUTH), Xanthi, Greece.

Frontiers in Robotics and AI

|January 8, 2024

Summary

This summary is machine-generated.

This study introduces hierarchical reinforcement learning (HRL) to simplify complex tasks for autonomous agents. This approach enhances agent training by disentangling actions and enabling dense rewards, improving overall performance.

Keywords:

autonomous agents formal methods in robotics and automation hierarchical reinforcement learning reinforcement learning task and motion planning

More Related Videos

A Real-world What-Where-When Memory Test

A Real-world What-Where-When Memory Test

Published on: May 16, 2017

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Related Experiment Videos

Last Updated: Jul 6, 2025

Transcranial Direct Current Stimulation tDCS of Wernicke's and Broca's Areas in Studies of Language Learning and Word Acquisition

Transcranial Direct Current Stimulation tDCS of Wernicke's and Broca's Areas in Studies of Language Learning and Word Acquisition

Published on: July 13, 2019

A Real-world What-Where-When Memory Test

A Real-world What-Where-When Memory Test

Published on: May 16, 2017

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Area of Science:

Artificial Intelligence
Machine Learning
Robotics

Background:

Complex tasks pose significant challenges for training autonomous agents.
Sparse reward functions in simulated environments often hinder effective learning.
Hierarchical Reinforcement Learning (HRL) offers a potential solution for task decomposition.

Purpose of the Study:

To propose and evaluate a novel hierarchical reinforcement learning (HRL) method for autonomous agent training.
To demonstrate the benefits of task decomposition in improving agent learning efficiency.
To leverage high-level abstractions for enhanced reward function design.

Main Methods:

Implementation of a hierarchical reinforcement learning (HRL) framework using TextWorld and MiniGrid Python environments.
Utilizing MiniGrid for 2D environment simulation and TextWorld for high-level task abstraction.
Designing a dense reward function for the lower-level environment based on task abstraction.
Employing formal methods to verify the solution-finding capabilities of the proposed algorithm.

Main Results:

The proposed HRL method successfully decomposes complex tasks into manageable sub-tasks.
Training on the TextWorld abstraction disentangles manipulation and navigation, simplifying the learning process.
The use of a dense reward function significantly improves agent training performance compared to sparse rewards.
Formal methods confirmed that the algorithm is capable of deriving solutions.

Conclusions:

Hierarchical reinforcement learning (HRL) with task abstraction provides an effective strategy for training autonomous agents.
The integration of TextWorld and MiniGrid facilitates a practical and efficient implementation of HRL.
Disentangling actions and employing dense rewards are key factors in enhancing agent performance in simulated environments.