Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Associative Learning01:27

Associative Learning

597
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
597
Understanding Memory01:19

Understanding Memory

641
Memory is the retention of information or experiences over time, facilitated through three main processes: encoding, storage, and retrieval. Encoding is the process of inputting information into the memory system. For instance, when listening to a lecture, watching a play, reading a book, or having a conversation, the brain is actively encoding information. This initial stage involves transforming sensory input into a form that can be processed and stored by the brain. Various factors, such as...
641
Long-Term Memory01:18

Long-Term Memory

262
Long-term memory is a relatively permanent type of memory, capable of storing vast amounts of information over extended periods. Its storage capacity is generally considered unlimited.
Long-term memory can be categorized into two primary types: explicit and implicit memory. Explicit memory, also known as declarative memory, involves the conscious recollection of information that we deliberately try to remember, recall, and articulate. This type of memory encompasses specific facts, events, and...
262
Observational Learning01:12

Observational Learning

319
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
319
Implicit Memories01:24

Implicit Memories

196
Implicit memories, also known as non-declarative memories, are long-term memories that function outside of conscious awareness. These memories influence behavior and skills without explicit knowledge. This type of memory is evident in tasks like playing tennis, snowboarding, and texting. Implicit memory has three subsystems: procedural memory, conditioning, and priming. This type of memory is essential in various activities, from everyday tasks to specialized skills.
One key aspect of implicit...
196
Long-term Potentiation01:25

Long-term Potentiation

2.9K
Long-term potentiation, or LTP, is one of the ways by which synaptic plasticity—changes in the strength of chemical synapses—can occur in the brain. LTP is the process of synaptic strengthening that occurs over time between pre and postsynaptic neuronal connections. The synaptic strengthening of LTP works in opposition to the synaptic weakening of long-term depression (LTD) and together are the main mechanisms that underlie learning and memory.
Hebbian LTP
LTP can occur when...
2.9K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Identification of Exercise-Related Signature Genes Potentially Associated with Cocaine Addiction by Integrating Bioinformatics and Mendelian Randomization Analysis.

Genes·2025
Same author

The synergistic pathogenic mechanism of <i>Brucella melitensis</i> M5-90 effector protein BspC in regulating host cell apoptosis, inflammation and oxidative stress.

Journal of medical microbiology·2025
Same author

Fast food consumption and risk of non-alcoholic fatty liver disease: a systematic review and meta-analysis.

Frontiers in public health·2025
Same author

Difference rewards policy gradients.

Neural computing & applications·2025
Same author

Application of a rapid and sensitive RPA-CRISPR/Cas12a assay for BCSP31-based Brucella detection.

Journal of microbiological methods·2025
Same author

Removal characteristics of heavy metals from polluted river water purified by hybrid constructed wetlands.

The Science of the total environment·2024
Same journal

Supporting human-agent communication for explainable planning in spatial-temporal planning problems.

Neural computing & applications·2026
Same journal

Contrastive learning-based video quality assessment-jointed video vision transformer for video recognition.

Neural computing & applications·2026
Same journal

Sequential pattern transformer (SPT): a generative and interpretable framework for predicting disease trajectories.

Neural computing & applications·2026
Same journal

Balancing misclassification errors in image-based inference using problem domain semantics and a nested cascade architecture.

Neural computing & applications·2025
Same journal

Deep multi-objective reinforcement learning for utility-based infrastructural maintenance optimization.

Neural computing & applications·2025
Same journal

A fairness scale for real-time recidivism forecasts using a national database of convicted offenders.

Neural computing & applications·2025
查看所有相关文章

相关实验视频

Updated: Sep 17, 2025

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents
09:01

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

12.7K

影响意识的记忆架构用于POMDPs的深度强化学习.

Miguel Suau1, Jinke He1, Elena Congeduti1

  • 1Intelligent Systems, Delft University of Technology, Delft, The Netherlands.

Neural computing & applications
|June 30, 2025
PubMed
概括
此摘要是机器生成的。

这项研究介绍了深度强化学习代理的影响意识记忆. 它通过将重复层集中在有影响力的变量上来提高训练速度和性能,克服了标准重复神经网络 (RNN) 的局限性.

关键词:
有条件的独立性 有条件的独立性影响力影响力影响力部分可观测性 部分可观测性经常性的神经网络.强化学习是一种强化学习.

更多相关视频

Transcranial Direct Current Stimulation tDCS for Memory Enhancement
10:37

Transcranial Direct Current Stimulation tDCS for Memory Enhancement

Published on: September 18, 2021

14.5K
A Lateralized Odor Learning Model in Neonatal Rats for Dissecting Neural Circuitry Underpinning Memory Formation
10:42

A Lateralized Odor Learning Model in Neonatal Rats for Dissecting Neural Circuitry Underpinning Memory Formation

Published on: August 18, 2014

9.1K

相关实验视频

Last Updated: Sep 17, 2025

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents
09:01

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

12.7K
Transcranial Direct Current Stimulation tDCS for Memory Enhancement
10:37

Transcranial Direct Current Stimulation tDCS for Memory Enhancement

Published on: September 18, 2021

14.5K
A Lateralized Odor Learning Model in Neonatal Rats for Dissecting Neural Circuitry Underpinning Memory Formation
10:42

A Lateralized Odor Learning Model in Neonatal Rats for Dissecting Neural Circuitry Underpinning Memory Formation

Published on: August 18, 2014

9.1K

科学领域:

  • 人工智能的人工智能
  • 机器学习 机器学习
  • 机器人技术 机器人技术 机器人技术

背景情况:

  • 代理人经常面临感知上的局限性,导致环境信息不足以进行最佳决策.
  • 循环神经网络 (RNN) 在深度强化学习中用于记忆过去的观察,但在高维数据中面临训练和融合挑战.
  • 环境中的部分可观测性需要有效的方法,使代理人能够从行动观察历史中推断出隐藏的状态信息.

研究的目的:

  • 提出一种新的记忆架构,影响意识的记忆,以解决深度强化学习中标准RNN的训练困难和性能限制.
  • 为了增强代理人发现隐藏状态信息的能力,尽管有感知限制.
  • 与现有方法相比,提高培训速度,政策绩效和运行时间效率.

主要方法:

  • 开发了一种影响意识的内存架构,将反复的层输入限制在影响隐藏状态信息的变量上.
  • 集成了一个前神经网络来处理非影响性观察变量.
  • 在RNN的内部内存中没有强制存储的允许信息流,与标准RNN反机制不同.

主要成果:

  • 影响意识的内存架构在训练速度和政策性能方面明显优于标准的循环架构.
  • 与传统方法相比,拟议的方法显示了较短的运行时间.
  • 实现了比堆叠多个观测以减轻部分可观测性的方法更好的性能得分.

结论:

  • 影响意识记忆提供了一个理论上灵感和有效的解决方案,用于处理深度强化学习中的部分可观察性.
  • 通过使重复层专注于关键变量,该方法提高了学习效率和代理性能.
  • 这种架构为在复杂环境中开发更有能力,更高效的智能代理提供了有希望的方向.