Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Cognitive Learning

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...

Introduction to Learning

Introduction to Learning

Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...

Purposive Learning

Purposive Learning

E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...

Law of Effect

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Deep-Learning-Enhanced Bioimaging Via Energy Traps Regulated Lanthanide Nanoparticles.

Angewandte Chemie (International ed. in English)·2026

Same author

Nationwide occurrence and site-scale dynamics of antibiotics in landfill-associated waters in China.

Water research·2026

Same author

Effects of interlayer cations and hydration on PFAS adsorption and mobility in montmorillonite: A molecular dynamics study.

Journal of contaminant hydrology·2026

Same author

Soft tactile chip with in-situ sensing for haptic rendering and reverse feedback enhanced gross to fine teleoperation.

Nature communications·2026

Same author

Efficient event-based object detection with lightweight CNNs: a surprising advantage over transformers.

Applied optics·2026

Same author

Preterm Birth International Collaborative Australasia Branch: Expert Consensus on Diagnosis and Treatment of Neonatal Lactose Intolerance (2025).

Pediatric discovery·2026

Same journal

DSPE-ViT: a lightweight vision transformer with dynamic sparse positional encoding for dense small object detection in UAV imagery.

Frontiers in neurorobotics·2026

Same journal

ST-HONet: Spatio-Temporal Hierarchical Network for long-horizon bimanual visuomotor imitation.

Frontiers in neurorobotics·2026

Same journal

ST-HADP: Spatio-Temporal hierarchical attention diffusion policy for long-horizon generalizable bimanual visuomotor imitation.

Frontiers in neurorobotics·2026

Same journal

EQISP: efficient quantized image signal processing with multi-scale pyramid fusion for resource constrained embodied perception.

Frontiers in neurorobotics·2026

Same journal

Research on embodied agent multimodal perception and real-time path planning algorithms for complex unstructured environments.

Frontiers in neurorobotics·2026

Same journal

NL-YOLOv5: a model with a larger receptive field and the ability to globally acquire features.

Frontiers in neurorobotics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 29, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Human skill knowledge guided global trajectory policy reinforcement learning method.

Yajing Zang¹, Pengfei Wang¹, Fusheng Zha¹

¹State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin, China.

Frontiers in Neurorobotics

|April 1, 2024

Summary

This summary is machine-generated.

This study introduces a novel global trajectory learning method combining Imitation Learning (IL) and Reinforcement Learning (RL). This approach enhances adaptability to new environments, improving robot trajectory control.

Keywords:

behavioral cloning imitation learning path planning probabilistic movement primitives reinforcement learning

More Related Videos

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Published on: June 22, 2015

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Published on: April 4, 2025

Related Experiment Videos

Last Updated: Jun 29, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Published on: June 22, 2015

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Published on: April 4, 2025

Area of Science:

Robotics
Machine Learning
Artificial Intelligence

Background:

Traditional Imitation Learning (IL) methods for trajectory learning are limited as they cannot adapt learned policies to new environments through interaction.
Existing methods often fail to fine-tune policies, restricting their applicability in dynamic or novel task settings.

Purpose of the Study:

To propose a novel global trajectory learning method that integrates Imitation Learning (IL) with Reinforcement Learning (RL).
To enable trajectory knowledge adaptation to specific task environments through interaction and policy fine-tuning.
To develop a more robust and adaptable trajectory learning system for robotic applications.

Main Methods:

Utilized IL to acquire foundational trajectory skills, including knowledge policy and temporal information, to guide the learning process.
Employed Reinforcement Learning (RL) for policy exploration and exploitation within the target environment, without using neural networks for action or Q-value modeling during RL.
Sampled and updated policies in the task space, transferring them to neural networks post-RL via Behavior Cloning (BC) for a smooth, continuous global trajectory policy.

Main Results:

Successfully validated the feasibility and effectiveness of the proposed method in a simulated flower drawing task within a custom Gym environment.
Demonstrated the ability to generate continuous and smooth global trajectory policies.
Executed the learned policy on a real-world robot, confirming its practical applicability.

Conclusions:

The combined IL and RL approach offers a significant advancement over traditional IL methods by enabling adaptive trajectory learning.
The method's success in both simulation and real-world experiments highlights its potential for complex robotic control tasks.
This approach provides a robust framework for learning and adapting complex trajectories in robotics.