Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Distributed Loads: Problem Solving

Distributed Loads: Problem Solving

Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Cognitive Learning

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...

Neural Regulation

Neural Regulation

Digestion begins with a cephalic phase that prepares the digestive system to receive food. When our brain processes visual or olfactory information about food, it triggers impulses in the cranial nerves innervating the salivary glands and stomach to prepare for food.

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Inverse lithography source optimization via compressive sensing.

Optics express·2014

Same author

[Determination of fatty acid esters of chloropropanediols in diet samples by gas chromatography-mass spectrometry coupled with solid-supported liquid-liquid extraction].

Wei sheng yan jiu = Journal of hygiene research·2014

Same author

[The reason for emulsification and method improvement for vitamin E detection in fish oil health food].

Wei sheng yan jiu = Journal of hygiene research·2014

Same author

A novel small deletion mutation in RUNX2 gene in one Chinese family with cleidocranial dysplasia.

International journal of clinical and experimental pathology·2014

Same author

Evaluating the impact of environmental temperature on global highly pathogenic avian influenza (HPAI) H5N1 outbreaks in domestic poultry.

International journal of environmental research and public health·2014

Same author

<i>In vivo</i> determination of muscle-derived stem cells in the rat corpus cavernosum.

Experimental and therapeutic medicine·2014

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

查看所有相关文章

Search research articles

相关实验视频

基于深度强化学习学习的云端资源调度和卸载优化.

Lili Yin¹, Yunze Xie¹, Ze Zhao¹

¹School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China.

Sensors (Basel, Switzerland)

|March 14, 2026

概括

此摘要是机器生成的。

本研究引入了用于智能制造的深度强化学习算法,显著减少工业物联网 (IoT) 环境中的任务中断和延迟. 该方法有效地管理动态边缘节点负载,用于实时处理.

关键词:

深度Q-网络是什么卷积神经网络是一种卷积神经网络.深度强化学习的学习.举报人举报人举报人举报人资源规划资源规划任务卸载任务卸载

相关实验视频

科学领域:

智能制造智能制造是一种智能制造.
物联网 (IoT) 的工业互联网.
边缘计算边缘计算

背景情况:

智能制造依赖于工业物联网 (IoT) 设备,产生许多需要实时处理的延迟敏感任务.
边缘节点负载的动态变化导致延迟增加和任务中断,这给云端边缘端协作带来了挑战.
现有的任务卸载策略与未知的边缘节点负载和动态系统状态作斗争.

研究的目的:

提出一种分布式算法,用于在智能制造环境中有效卸载任务.
为了应对未知的边缘节点负载和动态系统状态变化的挑战.
优化对延迟敏感任务的任务分配和执行顺序.

主要方法:

基于深度强化学习的分布式算法,结合了卷积神经网络 (CNN) 和Informer架构.
CNN提取边缘节点负载的局部特征;Informer的自我注意力捕捉了长期负载趋势.
集成决斗深度Q网络 (DQN) 和双DQN,用于精确的状态动作值函数近似.

主要成果:

拟议的算法可以将任务中断率降低82.3-94%.
与现有算法相比,平均延迟时间减少了28-39.2%.
该方法在高负载,延迟敏感的制造场景中显示出显著的优势.

结论:

开发的深度强化学习算法有效地处理动态边缘节点负载和系统不确定性.
移动设备的独立任务卸载决策使动态任务分配和优化执行成为可能.
该算法提供了一个强大的解决方案,用于实时处理与工业物联网的智能制造.