Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Cognitive Learning01:21

Cognitive Learning

239
Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...
239
Introduction to Learning01:18

Introduction to Learning

379
Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...
379
Purposive Learning01:22

Purposive Learning

119
E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...
119
Law of Effect01:06

Law of Effect

1.4K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
1.4K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Deep-Learning-Enhanced Bioimaging Via Energy Traps Regulated Lanthanide Nanoparticles.

Angewandte Chemie (International ed. in English)·2026
Same author

Nationwide occurrence and site-scale dynamics of antibiotics in landfill-associated waters in China.

Water research·2026
Same author

Effects of interlayer cations and hydration on PFAS adsorption and mobility in montmorillonite: A molecular dynamics study.

Journal of contaminant hydrology·2026
Same author

Soft tactile chip with in-situ sensing for haptic rendering and reverse feedback enhanced gross to fine teleoperation.

Nature communications·2026
Same author

Efficient event-based object detection with lightweight CNNs: a surprising advantage over transformers.

Applied optics·2026
Same author

Preterm Birth International Collaborative Australasia Branch: Expert Consensus on Diagnosis and Treatment of Neonatal Lactose Intolerance (2025).

Pediatric discovery·2026
Same journal

DSPE-ViT: a lightweight vision transformer with dynamic sparse positional encoding for dense small object detection in UAV imagery.

Frontiers in neurorobotics·2026
Same journal

ST-HONet: Spatio-Temporal Hierarchical Network for long-horizon bimanual visuomotor imitation.

Frontiers in neurorobotics·2026
Same journal

ST-HADP: Spatio-Temporal hierarchical attention diffusion policy for long-horizon generalizable bimanual visuomotor imitation.

Frontiers in neurorobotics·2026
Same journal

EQISP: efficient quantized image signal processing with multi-scale pyramid fusion for resource constrained embodied perception.

Frontiers in neurorobotics·2026
Same journal

Research on embodied agent multimodal perception and real-time path planning algorithms for complex unstructured environments.

Frontiers in neurorobotics·2026
Same journal

NL-YOLOv5: a model with a larger receptive field and the ability to globally acquire features.

Frontiers in neurorobotics·2026
查看所有相关文章

相关实验视频

Updated: Jun 29, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.7K

人类技能知识指导全球轨迹政策加强学习方法

Yajing Zang1, Pengfei Wang1, Fusheng Zha1

  • 1State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin, China.

Frontiers in neurorobotics
|April 1, 2024
PubMed
概括
此摘要是机器生成的。

本研究介绍了一种新的全球轨迹学习方法,它结合了模仿学习 (IL) 和强化学习 (RL). 这种方法提高了对新环境的适应能力,改善了机器人轨迹的控制.

关键词:
行为克隆行为克隆.模仿学习学习的学习.路径规划路径规划路径规划概率运动原始体的概率运动原始体.强化学习是一种强化学习.

更多相关视频

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats
08:59

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Published on: June 22, 2015

10.4K
Eye Tracking During A Complex Aviation Task For Insights Into Information Processing
07:48

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Published on: April 4, 2025

253

相关实验视频

Last Updated: Jun 29, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.7K
Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats
08:59

Acquisition of a High-precision Skilled Forelimb Reaching Task in Rats

Published on: June 22, 2015

10.4K
Eye Tracking During A Complex Aviation Task For Insights Into Information Processing
07:48

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing

Published on: April 4, 2025

253

科学领域:

  • 机器人技术 机器人技术 机器人技术
  • 机器学习 机器学习
  • 人工智能的人工智能

背景情况:

  • 传统的模仿学习 (IL) 轨迹学习方法是有限的,因为它们无法通过交互来适应新环境的学习政策.
  • 现有的方法往往无法微调政策,限制它们在动态或新任务设置中的适用性.

研究的目的:

  • 提出一种新的全球轨迹学习方法,将模仿学习 (IL) 与强化学习 (RL) 整合在一起.
  • 通过互动和政策微调,使轨迹知识适应特定的任务环境.
  • 为机器人应用开发一个更强大,更适应的轨迹学习系统.

主要方法:

  • 利用国际知识来获得基础的轨迹技能,包括知识政策和时间信息,以指导学习过程.
  • 使用强化学习 (RL) 在目标环境中进行政策探索和利用,在RL过程中不使用神经网络进行行动或Q值建模.
  • 在任务空间中采样和更新策略,通过行为克隆 (BC) 将它们传输到RL后的神经网络,以实现流,连续的全球轨迹策略.

主要成果:

  • 成功验证了拟议方法的可行性和有效性,在定制健身房环境中的模拟花画任务中.
  • 证明了产生持续和平稳的全球轨迹政策的能力.
  • 在现实世界的机器人上执行了学到的政策,证实了它的实际应用性.

结论:

  • 结合IL和RL方法提供了与传统IL方法相比的显著进步,因为它允许自适应性轨迹学习.
  • 该方法在模拟和现实实验中的成功凸显了它在复杂的机器人控制任务中的潜力.
  • 这种方法为机器人学中的复杂轨迹的学习和适应提供了一个强大的框架.