Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Differential Leveling

Differential Leveling

Differential leveling is a precise method in surveying used to determine the elevation difference between two points. Its primary goal is to establish accurate vertical measurements to create level surfaces or grade lines critical for designing and constructing infrastructures such as roads, bridges, and buildings.The procedure for differential leveling begins with setting up and leveling the instrument at a point where the benchmark can be seen. The level rod is held on the benchmark (BM), and...

Law of Effect

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Formation-Constrained Cooperative Localization for UAV Swarms in GNSS-Denied Environments.

Sensors (Basel, Switzerland)·2026

Same author

Kruppel-Like Factor 5 Modulates the Nuclear Factor Erythroid-2-Related Factor 2/Heme Oxygenase 1 Signalling Pathway to Regulate Vascular Smooth Muscle Cell Ferroptosis in Abdominal Aortic Aneurysm.

Clinical and experimental pharmacology & physiology·2026

Same author

Interpretable LASSO-Cox model: Hsp90α/albumin ratio predicts hepatocellular carcinoma prognosis.

Translational cancer research·2026

Same author

High-fat diet induces senescence in ADSCs via CDK4 ubiquitination-mediated cell cycle disruption, contributing to impaired glucose tolerance.

Molecular metabolism·2025

Same author

PROTAC repurposing uncovers a noncanonical binding surface that mediates chemical degradation of nuclear receptors.

Nature communications·2025

Same author

Syphilitic Descending Aortic Pseudoaneurysm.

European journal of vascular and endovascular surgery : the official journal of the European Society for Vascular Surgery·2025

Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026

Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026

Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026

Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026

Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026

Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 12, 2026

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Published on: January 12, 2024

An improved differential evolution algorithm based on reinforcement learning and its application.

Guangwei Yang^1,2, Peng Sun¹, Jieyong Zhang³

¹Information and Navigation College, Air Force Engineering University, Xi'an 710077, China.

Scientific Reports

|November 6, 2025

Summary

This summary is machine-generated.

This study introduces a novel reinforcement learning-based Differential Evolution (RLDE) algorithm to overcome parameter sensitivity and premature convergence in swarm intelligence optimization. RLDE demonstrates superior global optimization performance on complex, high-dimensional problems.

Keywords:

Differential evolution algorithm Halton sequence Hierarchical sorting Policy gradient network Reinforcement learning Task assignment

More Related Videos

Procedure for Adaptive Laboratory Evolution of Microorganisms Using a Chemostat

Procedure for Adaptive Laboratory Evolution of Microorganisms Using a Chemostat

Published on: September 20, 2016

Related Experiment Videos

Last Updated: Jan 12, 2026

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Published on: January 12, 2024

Procedure for Adaptive Laboratory Evolution of Microorganisms Using a Chemostat

Procedure for Adaptive Laboratory Evolution of Microorganisms Using a Chemostat

Published on: September 20, 2016

Area of Science:

Computational Intelligence
Optimization Algorithms
Swarm Intelligence

Background:

Differential Evolution (DE) is a powerful swarm intelligence method for high-dimensional problems.
DE suffers from parameter sensitivity and premature convergence, limiting its practical application.
Existing optimization methods require improvements in efficiency and adaptability.

Purpose of the Study:

To propose an improved Differential Evolution algorithm, RLDE, leveraging reinforcement learning.
To enhance the global optimization performance and address limitations of the standard DE algorithm.
To validate the algorithm's effectiveness on benchmark functions and a real-world engineering problem.

Main Methods:

Population initialization using Halton sequence for improved ergodicity.
Dynamic parameter adjustment via a reinforcement learning policy gradient network for adaptive scaling factor and crossover probability.
Differentiated mutation strategy based on population fitness classification.

Main Results:

RLDE significantly improved global optimization performance across 26 standard test functions.
The algorithm outperformed multiple heuristic optimization algorithms in 10, 30, and 50 dimensions.
Successful application to the Unmanned Aerial Vehicle (UAV) task assignment problem demonstrated practical engineering value.

Conclusions:

The proposed RLDE algorithm effectively addresses DE's parameter sensitivity and premature convergence.
RLDE offers enhanced global optimization capabilities for complex, high-dimensional problems.
The algorithm shows significant potential for real-world engineering applications, such as UAV task assignment.