Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

804
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
804
Reinforcement Schedules01:24

Reinforcement Schedules

440
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
440
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

1.3K
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
1.3K
Observational Learning01:12

Observational Learning

804
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
804
Differential Leveling01:12

Differential Leveling

647
Differential leveling is a precise method in surveying used to determine the elevation difference between two points. Its primary goal is to establish accurate vertical measurements to create level surfaces or grade lines critical for designing and constructing infrastructures such as roads, bridges, and buildings.The procedure for differential leveling begins with setting up and leveling the instrument at a point where the benchmark can be seen. The level rod is held on the benchmark (BM), and...
647
Law of Effect01:06

Law of Effect

2.4K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
2.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Formation-Constrained Cooperative Localization for UAV Swarms in GNSS-Denied Environments.

Sensors (Basel, Switzerland)·2026
Same author

Kruppel-Like Factor 5 Modulates the Nuclear Factor Erythroid-2-Related Factor 2/Heme Oxygenase 1 Signalling Pathway to Regulate Vascular Smooth Muscle Cell Ferroptosis in Abdominal Aortic Aneurysm.

Clinical and experimental pharmacology & physiology·2026
Same author

Interpretable LASSO-Cox model: Hsp90α/albumin ratio predicts hepatocellular carcinoma prognosis.

Translational cancer research·2026
Same author

High-fat diet induces senescence in ADSCs via CDK4 ubiquitination-mediated cell cycle disruption, contributing to impaired glucose tolerance.

Molecular metabolism·2025
Same author

PROTAC repurposing uncovers a noncanonical binding surface that mediates chemical degradation of nuclear receptors.

Nature communications·2025
Same author

Syphilitic Descending Aortic Pseudoaneurysm.

European journal of vascular and endovascular surgery : the official journal of the European Society for Vascular Surgery·2025
Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026
Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026
Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026
Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026
Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026
Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jan 12, 2026

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation
05:08

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Published on: January 12, 2024

2.2K

An improved differential evolution algorithm based on reinforcement learning and its application.

Guangwei Yang1,2, Peng Sun1, Jieyong Zhang3

  • 1Information and Navigation College, Air Force Engineering University, Xi'an 710077, China.

Scientific Reports
|November 6, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a novel reinforcement learning-based Differential Evolution (RLDE) algorithm to overcome parameter sensitivity and premature convergence in swarm intelligence optimization. RLDE demonstrates superior global optimization performance on complex, high-dimensional problems.

Keywords:
Differential evolution algorithmHalton sequenceHierarchical sortingPolicy gradient networkReinforcement learningTask assignment

More Related Videos

Procedure for Adaptive Laboratory Evolution of Microorganisms Using a Chemostat
06:03

Procedure for Adaptive Laboratory Evolution of Microorganisms Using a Chemostat

Published on: September 20, 2016

15.1K

Related Experiment Videos

Last Updated: Jan 12, 2026

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation
05:08

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Published on: January 12, 2024

2.2K
Procedure for Adaptive Laboratory Evolution of Microorganisms Using a Chemostat
06:03

Procedure for Adaptive Laboratory Evolution of Microorganisms Using a Chemostat

Published on: September 20, 2016

15.1K

Area of Science:

  • Computational Intelligence
  • Optimization Algorithms
  • Swarm Intelligence

Background:

  • Differential Evolution (DE) is a powerful swarm intelligence method for high-dimensional problems.
  • DE suffers from parameter sensitivity and premature convergence, limiting its practical application.
  • Existing optimization methods require improvements in efficiency and adaptability.

Purpose of the Study:

  • To propose an improved Differential Evolution algorithm, RLDE, leveraging reinforcement learning.
  • To enhance the global optimization performance and address limitations of the standard DE algorithm.
  • To validate the algorithm's effectiveness on benchmark functions and a real-world engineering problem.

Main Methods:

  • Population initialization using Halton sequence for improved ergodicity.
  • Dynamic parameter adjustment via a reinforcement learning policy gradient network for adaptive scaling factor and crossover probability.
  • Differentiated mutation strategy based on population fitness classification.

Main Results:

  • RLDE significantly improved global optimization performance across 26 standard test functions.
  • The algorithm outperformed multiple heuristic optimization algorithms in 10, 30, and 50 dimensions.
  • Successful application to the Unmanned Aerial Vehicle (UAV) task assignment problem demonstrated practical engineering value.

Conclusions:

  • The proposed RLDE algorithm effectively addresses DE's parameter sensitivity and premature convergence.
  • RLDE offers enhanced global optimization capabilities for complex, high-dimensional problems.
  • The algorithm shows significant potential for real-world engineering applications, such as UAV task assignment.