Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

664
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
664
Distributed Loads: Problem Solving01:21

Distributed Loads: Problem Solving

1.2K
Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...
1.2K
Reinforcement01:23

Reinforcement

1.1K
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
1.1K
Observational Learning01:12

Observational Learning

1.2K
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
1.2K
Cognitive Learning01:21

Cognitive Learning

1.5K
Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...
1.5K
Neural Regulation01:37

Neural Regulation

44.0K
Digestion begins with a cephalic phase that prepares the digestive system to receive food. When our brain processes visual or olfactory information about food, it triggers impulses in the cranial nerves innervating the salivary glands and stomach to prepare for food.
44.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Inverse lithography source optimization via compressive sensing.

Optics expressยท2014
Same author

[Determination of fatty acid esters of chloropropanediols in diet samples by gas chromatography-mass spectrometry coupled with solid-supported liquid-liquid extraction].

Wei sheng yan jiu = Journal of hygiene researchยท2014
Same author

[The reason for emulsification and method improvement for vitamin E detection in fish oil health food].

Wei sheng yan jiu = Journal of hygiene researchยท2014
Same author

A novel small deletion mutation in RUNX2 gene in one Chinese family with cleidocranial dysplasia.

International journal of clinical and experimental pathologyยท2014
Same author

Evaluating the impact of environmental temperature on global highly pathogenic avian influenza (HPAI) H5N1 outbreaks in domestic poultry.

International journal of environmental research and public healthยท2014
Same author

<i>In vivo</i> determination of muscle-derived stem cells in the rat corpus cavernosum.

Experimental and therapeutic medicineยท2014
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)ยท2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)ยท2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)ยท2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)ยท2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)ยท2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)ยท2026
See all related articles

Related Experiment Videos

Cloud-Edge Resource Scheduling and Offloading Optimization Based on Deep Reinforcement Learning.

Lili Yin1, Yunze Xie1, Ze Zhao1

  • 1School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China.

Sensors (Basel, Switzerland)
|March 14, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a deep reinforcement learning algorithm for smart manufacturing, significantly reducing task dropouts and latency in Industrial Internet of Things (IoT) environments. The method effectively manages dynamic edge node loads for real-time processing.

Keywords:
Deep Q-Networksconvolutional neural networksdeep reinforcement learninginformerresource schedulingtask offloading

Related Experiment Videos

Area of Science:

  • Smart Manufacturing
  • Industrial Internet of Things (IoT)
  • Edge Computing

Background:

  • Smart manufacturing relies on Industrial Internet of Things (IoT) devices, generating numerous latency-sensitive tasks requiring real-time processing.
  • Dynamic changes in edge node load cause increased latency and task dropouts, posing challenges for cloud-edge-end collaboration.
  • Existing task offloading strategies struggle with unknown edge node loads and dynamic system states.

Purpose of the Study:

  • To propose a distributed algorithm for effective task offloading in smart manufacturing environments.
  • To address the challenges of unknown edge node loads and dynamic system state changes.
  • To optimize task allocation and execution order for latency-sensitive tasks.

Main Methods:

  • A distributed algorithm based on deep reinforcement learning, incorporating Convolutional Neural Networks (CNN) and the Informer architecture.
  • CNN extracts local features of edge node loads; Informer's self-attention captures long-term load trends.
  • Integration of Dueling Deep Q-Network (DQN) and Double DQN for precise state-action value function approximation.

Main Results:

  • The proposed algorithm reduces task dropout rates by 82.3-94%.
  • Average latency is decreased by 28-39.2% compared to existing algorithms.
  • The method demonstrates significant advantages in high-load, latency-sensitive manufacturing scenarios.

Conclusions:

  • The developed deep reinforcement learning algorithm effectively handles dynamic edge node loads and system uncertainties.
  • Independent task offloading decisions by mobile devices enable dynamic task allocation and optimized execution.
  • The algorithm offers a robust solution for real-time processing in smart manufacturing with Industrial IoT.