Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Distributed Loads: Problem Solving

Distributed Loads: Problem Solving

Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Cognitive Learning

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...

Neural Regulation

Neural Regulation

Digestion begins with a cephalic phase that prepares the digestive system to receive food. When our brain processes visual or olfactory information about food, it triggers impulses in the cranial nerves innervating the salivary glands and stomach to prepare for food.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Inverse lithography source optimization via compressive sensing.

Optics express·2014

Same author

[Determination of fatty acid esters of chloropropanediols in diet samples by gas chromatography-mass spectrometry coupled with solid-supported liquid-liquid extraction].

Wei sheng yan jiu = Journal of hygiene research·2014

Same author

[The reason for emulsification and method improvement for vitamin E detection in fish oil health food].

Wei sheng yan jiu = Journal of hygiene research·2014

Same author

A novel small deletion mutation in RUNX2 gene in one Chinese family with cleidocranial dysplasia.

International journal of clinical and experimental pathology·2014

Same author

Evaluating the impact of environmental temperature on global highly pathogenic avian influenza (HPAI) H5N1 outbreaks in domestic poultry.

International journal of environmental research and public health·2014

Same author

<i>In vivo</i> determination of muscle-derived stem cells in the rat corpus cavernosum.

Experimental and therapeutic medicine·2014

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Videos

Cloud-Edge Resource Scheduling and Offloading Optimization Based on Deep Reinforcement Learning.

Lili Yin¹, Yunze Xie¹, Ze Zhao¹

¹School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China.

Sensors (Basel, Switzerland)

|March 14, 2026

Summary

This summary is machine-generated.

This study introduces a deep reinforcement learning algorithm for smart manufacturing, significantly reducing task dropouts and latency in Industrial Internet of Things (IoT) environments. The method effectively manages dynamic edge node loads for real-time processing.

Keywords:

Deep Q-Networks convolutional neural networks deep reinforcement learning informer resource scheduling task offloading

Related Experiment Videos

Area of Science:

Smart Manufacturing
Industrial Internet of Things (IoT)
Edge Computing

Background:

Smart manufacturing relies on Industrial Internet of Things (IoT) devices, generating numerous latency-sensitive tasks requiring real-time processing.
Dynamic changes in edge node load cause increased latency and task dropouts, posing challenges for cloud-edge-end collaboration.
Existing task offloading strategies struggle with unknown edge node loads and dynamic system states.

Purpose of the Study:

To propose a distributed algorithm for effective task offloading in smart manufacturing environments.
To address the challenges of unknown edge node loads and dynamic system state changes.
To optimize task allocation and execution order for latency-sensitive tasks.

Main Methods:

A distributed algorithm based on deep reinforcement learning, incorporating Convolutional Neural Networks (CNN) and the Informer architecture.
CNN extracts local features of edge node loads; Informer's self-attention captures long-term load trends.
Integration of Dueling Deep Q-Network (DQN) and Double DQN for precise state-action value function approximation.

Main Results:

The proposed algorithm reduces task dropout rates by 82.3-94%.
Average latency is decreased by 28-39.2% compared to existing algorithms.
The method demonstrates significant advantages in high-load, latency-sensitive manufacturing scenarios.

Conclusions:

The developed deep reinforcement learning algorithm effectively handles dynamic edge node loads and system uncertainties.
Independent task offloading decisions by mobile devices enable dynamic task allocation and optimized execution.
The algorithm offers a robust solution for real-time processing in smart manufacturing with Industrial IoT.