Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

Placing Concrete

Placing Concrete

The concrete is placed as close as possible to its final position to avoid segregation. The placed concrete is then fully compacted to expel the entrapped air, and the next layer of concrete is laid while the underlying layer is still in the plastic state. The rate at which concrete is placed and compacted is kept equal.
While placing concrete, care is taken to ensure that the concrete is laid in uniform layers, and hand shoveling and moving concrete using poker vibrators is avoided. Also,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Dual-drug-loaded nanohydrogel for intraoperative local application: sequential release-mediated spatiotemporal targeting of diverse secondary injury mechanisms to improve long-term prognosis in traumatic brain injury.

BMC medicine·2026

Same author

Ultra-Stable 2D Magneto-Fluorescent Probe-Mediated Multiplex Immunochromatographic Assay for Precise Bedside Detection of Sepsis.

ACS nano·2026

Same author

A two-stage signal enhancement method integrating the Gaussian mixture model and adaptive rolling ball technique for ultrasensitive fluorescent immunochromatographic detection.

Analytical methods : advancing methods and applications·2026

Same author

An integrated green strategy based on deep eutectic solvents and ultrasound for efficient polyphenol profiling and antioxidant evaluation of dendrobium officinale.

Ultrasonics sonochemistry·2026

Same author

Integrated cervicocerebral ultrasound-based hemodynamic compensation scoring for anterior-circulation steno-occlusive disease: validation against CT perfusion staging.

Quantitative imaging in medicine and surgery·2026

Same author

MRCNet: Motion Reasoning Chain for Cross Modal Video Camouflaged Object Detection.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Research on a Regional Availability Evaluation Model for Road-Area High-Entropy Energy Based on Synergy Factors.

Entropy (Basel, Switzerland)·2026

Same journal

Atmospheric Turbulence Channel Modeling and Performance Analysis of a CO-ZP-OFDM Coherent Optical Communication System for UAV Air-to-Ground Scenarios.

Entropy (Basel, Switzerland)·2026

Same journal

Information Geometry and Asymptotic Theory for SMML Estimators.

Entropy (Basel, Switzerland)·2026

Same journal

Correlation Entropy and Power-Law Kinetics.

Entropy (Basel, Switzerland)·2026

Same journal

Research on the Contagion of Systemic Financial Risk Under the Impact of Climate Risks-From the Perspective of Complex Networks and Machine Learning.

Entropy (Basel, Switzerland)·2026

Same journal

The Statistical-Mechanical Meaning of the Wave Function of Quantum Mechanics.

Entropy (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 29, 2025

Place and Response Learning in the Open-field Tower Maze

Place and Response Learning in the Open-field Tower Maze

Published on: October 28, 2015

An Edge Server Placement Method Based on Reinforcement Learning.

Fei Luo¹, Shuai Zheng¹, Weichao Ding¹

¹School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China.

Entropy (Basel, Switzerland)

|March 25, 2022

Summary

This summary is machine-generated.

A new deep reinforcement learning algorithm, DQN-ESPA, optimizes edge server placement in mobile edge computing. It achieves superior performance over existing methods by considering access delay and workload balance.

Keywords:

access delay edge computing markov decision process reinforcement learning workload balance

More Related Videos

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

Published on: March 3, 2023

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

Related Experiment Videos

Last Updated: Sep 29, 2025

Place and Response Learning in the Open-field Tower Maze

Place and Response Learning in the Open-field Tower Maze

Published on: October 28, 2015

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

Published on: March 3, 2023

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

Area of Science:

Computer Science
Artificial Intelligence
Network Engineering

Background:

Mobile edge computing (MEC) server placement is a complex multi-objective optimization problem.
Existing methods like mixed integer programming and heuristic algorithms suffer from scalability issues, local optima, and tuning difficulties.

Purpose of the Study:

To propose a novel edge server placement algorithm, DQN-ESPA, based on deep Q-network and reinforcement learning.
To achieve optimal edge server placements without prior experience, overcoming limitations of traditional approaches.

Main Methods:

Modeling the edge server placement problem as a Markov decision process (MDP).
Formalizing the MDP with state space, action space, and reward function.
Solving the MDP using a reinforcement learning algorithm (deep Q-network).

Main Results:

DQN-ESPA demonstrated superior performance compared to Simulated Annealing Placement Algorithm (SAPA), Top-K Placement Algorithm (TKPA), K-Means Placement Algorithm (KMPA), and Random Placement Algorithm (RPA).
Experimental results on Shanghai Telecom datasets showed significant improvements in placement performance.
Achieved up to 13.40% and 15.54% better placement for 100 and 300 edge servers, respectively, by considering access delay and workload balance.

Conclusions:

DQN-ESPA offers an effective and scalable solution for the edge server placement problem in MEC systems.
The reinforcement learning approach enables optimal placements by learning from the environment without relying on previous data or experience.
The algorithm provides significant performance gains, particularly in scenarios demanding low access delay and balanced workloads.