Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Distributed Loads: Problem Solving

Distributed Loads: Problem Solving

Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...

Optimization Problems

Optimization Problems

Optimization problems often involve identifying maximum or minimum values under specific constraints. A well-known example is determining the longest horizontal pipe that can be moved around a right-angled corner, where a 3-meter-wide hallway meets a 2-meter-wide hallway. This scenario, common in architectural design and industrial transport, can be understood conceptually through geometric and trigonometric reasoning.To visualize the problem, consider the pipe as a straight line that touches...

Rolling Resistance: Problem Solving

Rolling Resistance: Problem Solving

Rolling resistance, also known as rolling friction, is the force that resists the motion of a rolling object, such as a wheel, tire, or ball, when it moves over a surface. It is caused by the deformation of the object and the surface in contact with each other, as well as other factors like internal friction, hysteresis, and energy losses within the materials. Rolling resistance opposes the object's motion, requiring additional energy to overcome it and maintain movement. In practical...

Optimal Foraging

Optimal Foraging

How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Corrigendum to "Piezo1 activation mediates stiffness-induced aortic medial calcification: Pharmacological evidence from agonist and antagonist studies" [Eur. J. Pharmacol. 1011 (2026) 178465].

European journal of pharmacology·2026

Same author

Risk factors for severe <i>Chlamydia pneumoniae</i> pneumonia in children: a retrospective case-control study.

Frontiers in pediatrics·2026

Same author

Foundation model for screening severe mitral regurgitation and severe aortic stenosis from coronary angiograms.

Visual computing for industry, biomedicine, and art·2026

Same author

Metabolic hormone and adipokine alterations in major depressive disorder in relation to the acute-phase inflammatory response and early-life adversity.

Communications medicine·2026

Same author

Case Report: Case of severe allergic reaction in a child caused by chlorhexidine skin disinfectant.

Frontiers in allergy·2026

Same author

A simultaneous dual watermarking scheme for deep learning models.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Supporting human-agent communication for explainable planning in spatial-temporal planning problems.

Neural computing & applications·2026

Same journal

Contrastive learning-based video quality assessment-jointed video vision transformer for video recognition.

Neural computing & applications·2026

Same journal

Sequential pattern transformer (SPT): a generative and interpretable framework for predicting disease trajectories.

Neural computing & applications·2026

Same journal

Balancing misclassification errors in image-based inference using problem domain semantics and a nested cascade architecture.

Neural computing & applications·2025

Same journal

A fairness scale for real-time recidivism forecasts using a national database of convicted offenders.

Neural computing & applications·2025

Same journal

Gene expression clock: an unsupervised deep learning approach for predicting circadian rhythmicity from whole genome expression.

Neural computing & applications·2025

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 15, 2026

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Deep multi-objective reinforcement learning for utility-based infrastructural maintenance optimization.

Jesse van Remmerden¹, Maurice Kenter², Diederik M Roijers^2,3

¹Information Systems IE&IS, Eindhoven University of Technology, De Zaale, 5600 MB Eindhoven, The Netherlands.

Neural Computing & Applications

|October 13, 2025

Summary

This summary is machine-generated.

This study introduces multi-objective deep centralized multi-agent actor-critic (MO-DCMAC) for infrastructure maintenance. MO-DCMAC optimizes policies for multiple objectives, outperforming traditional methods in cost and safety assessments.

Keywords:

Infrastructure Maintenance Multi-objective reinforcement learning Reinforcement learning

Related Experiment Videos

Last Updated: Jan 15, 2026

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Area of Science:

Artificial Intelligence
Civil Engineering
Operations Research

Background:

Infrastructure maintenance traditionally uses single-objective reinforcement learning (RL), often combining multiple goals like cost and safety into one reward.
This reward-shaping can oversimplify complex decision-making processes for asset management.

Purpose of the Study:

Introduce multi-objective deep centralized multi-agent actor-critic (MO-DCMAC) for direct multi-objective optimization in infrastructure maintenance.
Enable optimization even with nonlinear utility functions, improving upon traditional RL limitations.

Main Methods:

Developed MO-DCMAC, a novel multi-objective reinforcement learning approach.
Evaluated MO-DCMAC using threshold and Failure Mode, Effects, and Criticality Analysis (FMECA) utility functions.
Tested in diverse maintenance environments, including Amsterdam's historical quay walls, comparing against rule-based policies.

Main Results:

MO-DCMAC effectively optimizes maintenance policies for multiple objectives simultaneously.
Demonstrated superior performance compared to existing rule-based heuristic policies across various scenarios.
Validated the method's effectiveness with different utility functions and complex environments.

Conclusions:

MO-DCMAC offers a significant advancement over single-objective RL for infrastructure maintenance optimization.
The method provides a more robust and effective approach for balancing competing objectives like cost and safety.
This research paves the way for more sophisticated and efficient asset management strategies.