Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Distributed Loads: Problem Solving

Distributed Loads: Problem Solving

Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Avoidance Learning and Learned Helplessness

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Localization Performance Analysis and Algorithm Design of Reconfigurable Intelligent Surface-Assisted D2D Systems.

Sensors (Basel, Switzerland)·2024

Same author

Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning.

Sensors (Basel, Switzerland)·2023

Same author

Grassland ecology system: A critical reservoir and dissemination medium of antibiotic resistance in Xilingol Pasture, Inner Mongolia.

The Science of the total environment·2021

Same author

Efficacy and Safety of First-Line Chemotherapies for Patients With Advanced Biliary Tract Carcinoma: A Systematic Review and Network Meta-Analysis.

Frontiers in oncology·2021

Same author

Quantitation of plasma metanephrines using isotope dilution liquid chromatography tandem mass spectrometry (ID-LC/MS/MS): a candidate reference measurement procedure and its application to evaluating routine ID-LC/MS/MS methods.

Analytical and bioanalytical chemistry·2021

Same author

Phosphorylation of CAP1 regulates lung cancer proliferation, migration, and invasion.

Journal of cancer research and clinical oncology·2021

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 27, 2025

Author Spotlight: Enhancing Upper Limb Rehabilitation in Stroke Patients Through Advanced Robotic and Neuromodulation Technologies

Author Spotlight: Enhancing Upper Limb Rehabilitation in Stroke Patients Through Advanced Robotic and Neuromodulation Technologies

Published on: October 11, 2024

D2D-Assisted Multi-User Cooperative Partial Offloading in MEC Based on Deep Reinforcement Learning.

Xin Guan¹, Tiejun Lv¹, Zhipeng Lin²

¹School of Information and Communication Engineering, Beijing University of Posts and Telecommunications (BUPT), Beijing 100876, China.

Sensors (Basel, Switzerland)

|September 23, 2022

Summary

This summary is machine-generated.

This study introduces a cooperative framework combining mobile edge computing (MEC) and device-to-device (D2D) communication to enhance mobile device capabilities. A deep reinforcement learning approach maximizes task computation under delay and resource constraints.

Keywords:

D2D communication Q learning deep Q-network mobile edge computing partial offloading

More Related Videos

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

A Flexible Platform for Monitoring Cerebellum-Dependent Sensory Associative Learning

A Flexible Platform for Monitoring Cerebellum-Dependent Sensory Associative Learning

Published on: January 19, 2022

Related Experiment Videos

Last Updated: Aug 27, 2025

Author Spotlight: Enhancing Upper Limb Rehabilitation in Stroke Patients Through Advanced Robotic and Neuromodulation Technologies

Author Spotlight: Enhancing Upper Limb Rehabilitation in Stroke Patients Through Advanced Robotic and Neuromodulation Technologies

Published on: October 11, 2024

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

A Flexible Platform for Monitoring Cerebellum-Dependent Sensory Associative Learning

A Flexible Platform for Monitoring Cerebellum-Dependent Sensory Associative Learning

Published on: January 19, 2022

Area of Science:

Computer Science
Electrical Engineering
Wireless Communications

Background:

Mobile devices face limitations in computing power and battery life.
Mobile Edge Computing (MEC) and Device-to-Device (D2D) communication offer solutions by offloading tasks and enabling local resource sharing.
Existing frameworks often struggle with efficient resource allocation and task offloading in multi-user scenarios.

Purpose of the Study:

To develop a novel D2D-MEC framework for cooperative partial offloading and computing resource allocation.
To maximize the number of devices served within application delay constraints and limited edge computing resources.
To address the NP-hard optimization problem of resource allocation in a D2D-MEC system.

Main Methods:

Formulation of the multi-user cooperative partial offloading and resource allocation problem.
Decoupling the NP-hard problem into two subproblems.
Application of convex optimization for the first subproblem.
Modeling the second subproblem as a Markov Decision Process (MDP).
Development of a Deep Q Network (DQN) algorithm for task computation maximization.

Main Results:

The proposed D2D-MEC framework effectively manages task offloading and resource allocation.
The integrated approach significantly enhances the system's ability to compute tasks under strict delay and resource limitations.
Simulation results validate the superiority and effectiveness of the developed deep reinforcement learning-based scheme compared to existing methods.

Conclusions:

The cooperative D2D-MEC framework provides a robust solution for mobile device resource constraints.
Deep reinforcement learning, specifically DQN, is highly effective in optimizing task offloading and resource allocation in complex MEC environments.
The proposed scheme demonstrates significant performance gains in maximizing computational capacity while adhering to critical application deadlines.