Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Decision Making: P-value Method

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim is also stated. These statements can act as null and alternative hypotheses: a null hypothesis would be a neutral statement while the alternative hypothesis can...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Multicompartment Models: Overview

Multicompartment Models: Overview

Multicompartment models are mathematical constructs that depict how drugs are distributed and eliminated within the body. They segment the body into several compartments, symbolizing various physiological or anatomical areas connected through drug transfer processes such as absorption, metabolism, distribution, and elimination.
These models offer a more comprehensive representation of drug behavior in the body than one-compartment models. They accommodate the complexity of drug distribution,...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Propagation of Uncertainty from Systematic Error

Propagation of Uncertainty from Systematic Error

The atomic mass of an element varies due to the relative ratio of its isotopes. A sample's relative proportion of oxygen isotopes influences its average atomic mass. For instance, if we were to measure the atomic mass of oxygen from a sample, the mass would be a weighted average of the isotopic masses of oxygen in that sample. Since a single sample is not likely to perfectly reflect the true atomic mass of oxygen for all the molecules of oxygen on Earth, the mass we obtain from this...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Targeting Pyruvate Kinase M2: Signal Transduction Pathways and Exploration of Cancer Therapeutic Strategies.

The AAPS journal·2026

Same author

Natural Products Beyond Inhibition: A Mechanistic Framework Spanning Pockets, Interfaces, and Kinetic Barriers.

Molecules (Basel, Switzerland)·2026

Same author

Unraveling the pleiotropic effects of CCR2-dependent signal transduction in fibrosis development.

Theranostics·2026

Same author

Novel <i>VRK1</i> Variants and a Founder Effect in Axonal Polyneuropathy.

Neurology. Genetics·2026

Same author

From complex algorithms to clinical practice: a multicenter machine learning model and simplified decision tree for predicting cachexia risk in gastric cancer.

Frontiers in oncology·2026

Same author

The p75NTR signaling axis: Bridging neurodevelopmental homeostasis, pathological mechanisms, and therapeutic strategies in neurodegenerative diseases.

Ageing research reviews·2026

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jul 9, 2025

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

一个政策梯度算法来缓解复杂环境中的多代理价值高估问题.

Yang Yang^1,2, Jiang Li^1,2, Jinyong Hou³

¹Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China.

Sensors (Basel, Switzerland)

|December 9, 2023

概括

此摘要是机器生成的。

我们介绍了基于实证集群层的多代理双对决政策梯度 (ECL-MAD3PG) 算法,以改进多代理强化学习. 这种新的方法提高了可靠性和稳定性,在无人机作战模拟中实现了9.1%的任务完成改进.

关键词:

深度决定性的政策梯度渐变.集团决策集团决策对价值函数的高估.经验的播放体验的播放.

更多相关视频

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Published on: September 19, 2012

The HoneyComb Paradigm for Research on Collective Human Behavior

The HoneyComb Paradigm for Research on Collective Human Behavior

Published on: January 19, 2019

相关实验视频

Last Updated: Jul 9, 2025

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Published on: September 19, 2012

The HoneyComb Paradigm for Research on Collective Human Behavior

The HoneyComb Paradigm for Research on Collective Human Behavior

Published on: January 19, 2019

科学领域:

人工智能的人工智能
机器学习机器学习
机器人技术机器人技术机器人技术

背景情况:

多代理强化学习 (MARL) 对于复杂,高维环境中的群体决策至关重要.
现有的深度政策梯度方法由于估计错误和经验质量下降而面临可靠性,稳定性和融合方面的挑战.
这些局限性阻碍了在像自主系统这样的苛刻应用中的性能.

研究的目的:

开发一种新的MARL算法,解决当前深度政策梯度方法的局限性.
为了提高决策算法的可靠性,稳定性和融合,在复杂的国家行动空间.
提高经验采样和整体算法性能的效率.

主要方法:

提出基于经验的聚类层的多代理双对决政策梯度 (ECL-MAD3PG) 算法.
整合一个经验集群层来完善体验质量和采样效率.
使用双对决架构来提高价值估计的准确性.

主要成果:

在各种复杂的环境中,ECL-MAD3PG算法表现出卓越的性能.
与多代理深度决定性政策梯度 (MADDPG) 算法相比,在任务完成方面取得了显著的9.1%的改进.
在具有挑战性的场景中展示了增强的可靠性和稳定性,特别是在无人机合作战斗中.

结论:

ECL-MAD3PG有效地克服了传统MARL算法的融合和稳定性问题.
拟议的算法为复杂的,高维度的决策问题提供了强大的解决方案.
在需要可靠和适应性的多代理协调的应用中,ECL-MAD3PG显示出显著的前景.