Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cognitive Learning01:21

Cognitive Learning

239
Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...
239
Introduction to Learning01:18

Introduction to Learning

379
Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...
379
Purposive Learning01:22

Purposive Learning

119
E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...
119
Law of Effect01:06

Law of Effect

1.4K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
1.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Deep-Learning-Enhanced Bioimaging Via Energy Traps Regulated Lanthanide Nanoparticles.

Angewandte Chemie (International ed. in English)·2026
Same author

Nationwide occurrence and site-scale dynamics of antibiotics in landfill-associated waters in China.

Water research·2026
Same author

Effects of interlayer cations and hydration on PFAS adsorption and mobility in montmorillonite: A molecular dynamics study.

Journal of contaminant hydrology·2026
Same author

Soft tactile chip with in-situ sensing for haptic rendering and reverse feedback enhanced gross to fine teleoperation.

Nature communications·2026
Same author

Efficient event-based object detection with lightweight CNNs: a surprising advantage over transformers.

Applied optics·2026
Same author

Preterm Birth International Collaborative Australasia Branch: Expert Consensus on Diagnosis and Treatment of Neonatal Lactose Intolerance (2025).

Pediatric discovery·2026
Same journal

DSPE-ViT: a lightweight vision transformer with dynamic sparse positional encoding for dense small object detection in UAV imagery.

Frontiers in neurorobotics·2026
Same journal

ST-HONet: Spatio-Temporal Hierarchical Network for long-horizon bimanual visuomotor imitation.

Frontiers in neurorobotics·2026
Same journal

ST-HADP: Spatio-Temporal hierarchical attention diffusion policy for long-horizon generalizable bimanual visuomotor imitation.

Frontiers in neurorobotics·2026
Same journal

EQISP: efficient quantized image signal processing with multi-scale pyramid fusion for resource constrained embodied perception.

Frontiers in neurorobotics·2026
Same journal

Research on embodied agent multimodal perception and real-time path planning algorithms for complex unstructured environments.

Frontiers in neurorobotics·2026
Same journal

NL-YOLOv5: a model with a larger receptive field and the ability to globally acquire features.

Frontiers in neurorobotics·2026
See all related articles

Related Experiment Video

Updated: Jun 29, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.7K

Human skill knowledge guided global trajectory policy reinforcement learning method.

Yajing Zang1, Pengfei Wang1, Fusheng Zha1

  • 1State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin, China.

Frontiers in Neurorobotics
|April 1, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a novel global trajectory learning method combining Imitation Learning (IL) and Reinforcement Learning (RL). This approach enhances adaptability to new environments, improving robot trajectory control.

Keywords:
behavioral cloningimitation learningpath planningprobabilistic movement primitivesreinforcement learning

More Related Videos

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats
08:59

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Published on: June 22, 2015

10.4K
Eye Tracking During A Complex Aviation Task For Insights Into Information Processing
07:48

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Published on: April 4, 2025

253

Related Experiment Videos

Last Updated: Jun 29, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.7K
Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats
08:59

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Published on: June 22, 2015

10.4K
Eye Tracking During A Complex Aviation Task For Insights Into Information Processing
07:48

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Published on: April 4, 2025

253

Area of Science:

  • Robotics
  • Machine Learning
  • Artificial Intelligence

Background:

  • Traditional Imitation Learning (IL) methods for trajectory learning are limited as they cannot adapt learned policies to new environments through interaction.
  • Existing methods often fail to fine-tune policies, restricting their applicability in dynamic or novel task settings.

Purpose of the Study:

  • To propose a novel global trajectory learning method that integrates Imitation Learning (IL) with Reinforcement Learning (RL).
  • To enable trajectory knowledge adaptation to specific task environments through interaction and policy fine-tuning.
  • To develop a more robust and adaptable trajectory learning system for robotic applications.

Main Methods:

  • Utilized IL to acquire foundational trajectory skills, including knowledge policy and temporal information, to guide the learning process.
  • Employed Reinforcement Learning (RL) for policy exploration and exploitation within the target environment, without using neural networks for action or Q-value modeling during RL.
  • Sampled and updated policies in the task space, transferring them to neural networks post-RL via Behavior Cloning (BC) for a smooth, continuous global trajectory policy.

Main Results:

  • Successfully validated the feasibility and effectiveness of the proposed method in a simulated flower drawing task within a custom Gym environment.
  • Demonstrated the ability to generate continuous and smooth global trajectory policies.
  • Executed the learned policy on a real-world robot, confirming its practical applicability.

Conclusions:

  • The combined IL and RL approach offers a significant advancement over traditional IL methods by enabling adaptive trajectory learning.
  • The method's success in both simulation and real-world experiments highlights its potential for complex robotic control tasks.
  • This approach provides a robust framework for learning and adapting complex trajectories in robotics.