Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Observational Learning01:12

Observational Learning

804
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
804
Graphs of Equations in Two Variables01:30

Graphs of Equations in Two Variables

180
An equation with two variables, typically written in the form y = f(x) or Ax + By = C, describes a relationship between quantities represented by x and y. Each solution to such an equation is an ordered pair (x, y) that satisfies the equation when substituted. These pairs can be represented graphically to understand the variables' relationship visually.A common technique for constructing the graph of a two-variable equation is to create a value table. Begin by choosing several values for the...
180
Reinforcement01:23

Reinforcement

804
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
804
Graphical Representation of Inequalities01:28

Graphical Representation of Inequalities

157
The graph of the equation where y equals x squared forms a curve known as a parabola. This curve acts as a boundary in the coordinate plane, dividing it into distinct regions based on the relative position of points.When the equality sign in the equation is replaced with an inequality—such as greater than, less than, greater than or equal to, or less than or equal to—the graphical representation changes from a single curve into a broader shaded area that signifies the set of all...
157
Associative Learning01:27

Associative Learning

1.2K
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
1.2K
Reinforcement Schedules01:24

Reinforcement Schedules

440
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
440

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Peripheral and central vestibular neuromodulation improve postural control in adolescent idiopathic scoliosis: a randomized, sham-controlled, multi-arm intervention study.

Journal of neuroengineering and rehabilitation·2026
Same author

Chemically Fueled Interfacial Supramolecular Polymerization.

ACS nano·2026
Same author

scCCVGBen for benchmarking of single-cell representation learning anchored on a centroid-coupled variational graph attention autoencoder across scRNA-seq and scATAC-seq.

Frontiers in genetics·2026
Same author

Reduced HAV IgG Seropositivity Among Unvaccinated People Living with HIV: The Weak Shield.

Tropical medicine and infectious disease·2026
Same author

Immunosuppression, resistance burden, and qSOFA on short-term prognosis and difficult clearance in hospitalized patients with Salmonella infection: a single-center retrospective cohort study.

BMC infectious diseases·2026
Same author

LAIOR: a hyperbolic neural ODE variational framework for interpretable single-cell manifold learning and trajectory inference.

Frontiers in genetics·2026
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
Same journal

Self-Supervised Continuous Dynamic Graph Representation Learning via Hawkes Processes.

IEEE transactions on neural networks and learning systems·2026
Same journal

cPU: Consistent Risk Estimator for Positive-Unlabeled Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Tuning-Free Latent Diffusion Models for Ultrahigh-Resolution Image Editing.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
查看所有相关文章

相关实验视频

Updated: Jan 11, 2026

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.3K

GCM:通过图形合作建模进行可解释的多代理增强学习.

Xuefei Wu, Yuanyang Zhu, Caihua Chen

    IEEE transactions on neural networks and learning systems
    |November 11, 2025
    PubMed
    概括
    此摘要是机器生成的。

    图形合作建模 (GCM) 通过使用图形结构进行透明的决策来增强多代理强化学习 (MARL). 这种方法提高了业绩,并提供了对代理合作模式的见解.

    更多相关视频

    Constructing and Visualizing Models using Mime-based Machine-learning Framework
    06:19

    Constructing and Visualizing Models using Mime-based Machine-learning Framework

    Published on: July 22, 2025

    2.3K

    相关实验视频

    Last Updated: Jan 11, 2026

    Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
    05:47

    Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

    Published on: June 13, 2025

    1.3K
    Constructing and Visualizing Models using Mime-based Machine-learning Framework
    06:19

    Constructing and Visualizing Models using Mime-based Machine-learning Framework

    Published on: July 22, 2025

    2.3K

    科学领域:

    • 人工智能的人工智能
    • 机器学习 机器学习
    • 多代理系统 多代理系统

    背景情况:

    • 多代理强化学习 (MARL) 面临着不透明神经网络决策的挑战.
    • 缺乏透明度阻碍了人们对MARL模型的理解和信任.
    • 数据固有的拓结构为MARL.提供了透明度的潜力.

    研究的目的:

    • 为透明的 MARL 引入图形合作建模 (GCM).
    • 使用图形结构捕捉和解释代理人之间复杂的协作动态.
    • 加强代理人信用分配,并专注于与任务相关的信息.

    主要方法:

    • 开发了使用图形结构来建模代理相互作用的GCM.
    • 集成了一个学习的度量函数来识别受益代理合作.
    • 使用图形神经网络 (GNN) 进行任意顺序交互建模.
    • 利用身份语义,全球状态和个人价值函数用于代理信用估计.

    主要成果:

    • 在具有挑战性的MARL基准指标上,GCM实现了高达28.75%的相对绩效增长.
    • 在超硬地图上显示了显著的改进.
    • 提供了对代理人之间潜在的合作模式的明确解释性.

    结论:

    • GCM提供了一种透明和有效的MARL.方法.
    • 这种基于图表的方法在多代理系统中提高了性能和可解释性.
    • 在复杂的MARL任务中,GCM有助于更深入地了解合作动态.