Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

402
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
402
Reinforcement Schedules01:24

Reinforcement Schedules

249
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
249
Observational Learning01:12

Observational Learning

349
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
349
Randomized Experiments01:13

Randomized Experiments

8.0K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
8.0K
Placing Concrete01:17

Placing Concrete

213
The concrete is placed as close as possible to its final position to avoid segregation. The placed concrete is then fully compacted to expel the entrapped air, and the next layer of concrete is laid while the underlying layer is still in the plastic state. The rate at which concrete is placed and compacted is kept equal.
While placing concrete, care is taken to ensure that the concrete is laid in uniform layers, and hand shoveling and moving concrete using poker vibrators is avoided. Also,...
213

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Dual-drug-loaded nanohydrogel for intraoperative local application: sequential release-mediated spatiotemporal targeting of diverse secondary injury mechanisms to improve long-term prognosis in traumatic brain injury.

BMC medicine·2026
Same author

Ultra-Stable 2D Magneto-Fluorescent Probe-Mediated Multiplex Immunochromatographic Assay for Precise Bedside Detection of Sepsis.

ACS nano·2026
Same author

A two-stage signal enhancement method integrating the Gaussian mixture model and adaptive rolling ball technique for ultrasensitive fluorescent immunochromatographic detection.

Analytical methods : advancing methods and applications·2026
Same author

An integrated green strategy based on deep eutectic solvents and ultrasound for efficient polyphenol profiling and antioxidant evaluation of dendrobium officinale.

Ultrasonics sonochemistry·2026
Same author

Integrated cervicocerebral ultrasound-based hemodynamic compensation scoring for anterior-circulation steno-occlusive disease: validation against CT perfusion staging.

Quantitative imaging in medicine and surgery·2026
Same author

MRCNet: Motion Reasoning Chain for Cross Modal Video Camouflaged Object Detection.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Research on a Regional Availability Evaluation Model for Road-Area High-Entropy Energy Based on Synergy Factors.

Entropy (Basel, Switzerland)·2026
Same journal

Atmospheric Turbulence Channel Modeling and Performance Analysis of a CO-ZP-OFDM Coherent Optical Communication System for UAV Air-to-Ground Scenarios.

Entropy (Basel, Switzerland)·2026
Same journal

Information Geometry and Asymptotic Theory for SMML Estimators.

Entropy (Basel, Switzerland)·2026
Same journal

Correlation Entropy and Power-Law Kinetics.

Entropy (Basel, Switzerland)·2026
Same journal

Research on the Contagion of Systemic Financial Risk Under the Impact of Climate Risks-From the Perspective of Complex Networks and Machine Learning.

Entropy (Basel, Switzerland)·2026
Same journal

The Statistical-Mechanical Meaning of the Wave Function of Quantum Mechanics.

Entropy (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Sep 29, 2025

Place and Response Learning in the Open-field Tower Maze
08:31

Place and Response Learning in the Open-field Tower Maze

Published on: October 28, 2015

10.6K

An Edge Server Placement Method Based on Reinforcement Learning.

Fei Luo1, Shuai Zheng1, Weichao Ding1

  • 1School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China.

Entropy (Basel, Switzerland)
|March 25, 2022
PubMed
Summary
This summary is machine-generated.

A new deep reinforcement learning algorithm, DQN-ESPA, optimizes edge server placement in mobile edge computing. It achieves superior performance over existing methods by considering access delay and workload balance.

Keywords:
access delayedge computingmarkov decision processreinforcement learningworkload balance

More Related Videos

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice
08:59

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

Published on: March 3, 2023

2.2K
Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit
05:30

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

675

Related Experiment Videos

Last Updated: Sep 29, 2025

Place and Response Learning in the Open-field Tower Maze
08:31

Place and Response Learning in the Open-field Tower Maze

Published on: October 28, 2015

10.6K
An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice
08:59

An Open-Source Virtual Reality System for the Measurement of Spatial Learning in Head-Restrained Mice

Published on: March 3, 2023

2.2K
Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit
05:30

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

675

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Network Engineering

Background:

  • Mobile edge computing (MEC) server placement is a complex multi-objective optimization problem.
  • Existing methods like mixed integer programming and heuristic algorithms suffer from scalability issues, local optima, and tuning difficulties.

Purpose of the Study:

  • To propose a novel edge server placement algorithm, DQN-ESPA, based on deep Q-network and reinforcement learning.
  • To achieve optimal edge server placements without prior experience, overcoming limitations of traditional approaches.

Main Methods:

  • Modeling the edge server placement problem as a Markov decision process (MDP).
  • Formalizing the MDP with state space, action space, and reward function.
  • Solving the MDP using a reinforcement learning algorithm (deep Q-network).

Main Results:

  • DQN-ESPA demonstrated superior performance compared to Simulated Annealing Placement Algorithm (SAPA), Top-K Placement Algorithm (TKPA), K-Means Placement Algorithm (KMPA), and Random Placement Algorithm (RPA).
  • Experimental results on Shanghai Telecom datasets showed significant improvements in placement performance.
  • Achieved up to 13.40% and 15.54% better placement for 100 and 300 edge servers, respectively, by considering access delay and workload balance.

Conclusions:

  • DQN-ESPA offers an effective and scalable solution for the edge server placement problem in MEC systems.
  • The reinforcement learning approach enables optimal placements by learning from the environment without relying on previous data or experience.
  • The algorithm provides significant performance gains, particularly in scenarios demanding low access delay and balanced workloads.