Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Cognitive Learning

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...

Introduction to Learning

Introduction to Learning

Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...

Purposive Learning

Purposive Learning

E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...

Law of Effect

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Deep-Learning-Enhanced Bioimaging Via Energy Traps Regulated Lanthanide Nanoparticles.

Angewandte Chemie (International ed. in English)·2026

Same author

Nationwide occurrence and site-scale dynamics of antibiotics in landfill-associated waters in China.

Water research·2026

Same author

Effects of interlayer cations and hydration on PFAS adsorption and mobility in montmorillonite: A molecular dynamics study.

Journal of contaminant hydrology·2026

Same author

Soft tactile chip with in-situ sensing for haptic rendering and reverse feedback enhanced gross to fine teleoperation.

Nature communications·2026

Same author

Efficient event-based object detection with lightweight CNNs: a surprising advantage over transformers.

Applied optics·2026

Same author

Preterm Birth International Collaborative Australasia Branch: Expert Consensus on Diagnosis and Treatment of Neonatal Lactose Intolerance (2025).

Pediatric discovery·2026

Same journal

DSPE-ViT: a lightweight vision transformer with dynamic sparse positional encoding for dense small object detection in UAV imagery.

Frontiers in neurorobotics·2026

Same journal

ST-HONet: Spatio-Temporal Hierarchical Network for long-horizon bimanual visuomotor imitation.

Frontiers in neurorobotics·2026

Same journal

ST-HADP: Spatio-Temporal hierarchical attention diffusion policy for long-horizon generalizable bimanual visuomotor imitation.

Frontiers in neurorobotics·2026

Same journal

EQISP: efficient quantized image signal processing with multi-scale pyramid fusion for resource constrained embodied perception.

Frontiers in neurorobotics·2026

Same journal

Research on embodied agent multimodal perception and real-time path planning algorithms for complex unstructured environments.

Frontiers in neurorobotics·2026

Same journal

NL-YOLOv5: a model with a larger receptive field and the ability to globally acquire features.

Frontiers in neurorobotics·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jun 29, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

人类技能知识指导全球轨迹政策加强学习方法

Yajing Zang¹, Pengfei Wang¹, Fusheng Zha¹

¹State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin, China.

Frontiers in neurorobotics

|April 1, 2024

概括

此摘要是机器生成的。

本研究介绍了一种新的全球轨迹学习方法,它结合了模仿学习 (IL) 和强化学习 (RL). 这种方法提高了对新环境的适应能力,改善了机器人轨迹的控制.

关键词:

行为克隆行为克隆.模仿学习学习的学习.路径规划路径规划路径规划概率运动原始体的概率运动原始体.强化学习是一种强化学习.

更多相关视频

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Published on: June 22, 2015

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Published on: April 4, 2025

相关实验视频

Last Updated: Jun 29, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Published on: June 22, 2015

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Published on: April 4, 2025

科学领域:

机器人技术机器人技术机器人技术
机器学习机器学习
人工智能的人工智能

背景情况:

传统的模仿学习 (IL) 轨迹学习方法是有限的,因为它们无法通过交互来适应新环境的学习政策.
现有的方法往往无法微调政策,限制它们在动态或新任务设置中的适用性.

研究的目的:

提出一种新的全球轨迹学习方法,将模仿学习 (IL) 与强化学习 (RL) 整合在一起.
通过互动和政策微调,使轨迹知识适应特定的任务环境.
为机器人应用开发一个更强大,更适应的轨迹学习系统.

主要方法:

利用国际知识来获得基础的轨迹技能,包括知识政策和时间信息,以指导学习过程.
使用强化学习 (RL) 在目标环境中进行政策探索和利用,在RL过程中不使用神经网络进行行动或Q值建模.
在任务空间中采样和更新策略,通过行为克隆 (BC) 将它们传输到RL后的神经网络,以实现流,连续的全球轨迹策略.

主要成果:

成功验证了拟议方法的可行性和有效性,在定制健身房环境中的模拟花画任务中.
证明了产生持续和平稳的全球轨迹政策的能力.
在现实世界的机器人上执行了学到的政策,证实了它的实际应用性.

结论:

结合IL和RL方法提供了与传统IL方法相比的显著进步,因为它允许自适应性轨迹学习.
该方法在模拟和现实实验中的成功凸显了它在复杂的机器人控制任务中的潜力.
这种方法为机器人学中的复杂轨迹的学习和适应提供了一个强大的框架.