Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

458
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
458
Reinforcement01:23

Reinforcement

830
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
830
Distributed Loads: Problem Solving01:21

Distributed Loads: Problem Solving

1.1K
Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...
1.1K
Optimization Problems01:26

Optimization Problems

9
Optimization problems often involve identifying maximum or minimum values under specific constraints. A well-known example is determining the longest horizontal pipe that can be moved around a right-angled corner, where a 3-meter-wide hallway meets a 2-meter-wide hallway. This scenario, common in architectural design and industrial transport, can be understood conceptually through geometric and trigonometric reasoning.To visualize the problem, consider the pipe as a straight line that touches...
9
Rolling Resistance: Problem Solving01:17

Rolling Resistance: Problem Solving

778
Rolling resistance, also known as rolling friction, is the force that resists the motion of a rolling object, such as a wheel, tire, or ball, when it moves over a surface. It is caused by the deformation of the object and the surface in contact with each other, as well as other factors like internal friction, hysteresis, and energy losses within the materials. Rolling resistance opposes the object's motion, requiring additional energy to overcome it and maintain movement. In practical...
778
Optimal Foraging00:48

Optimal Foraging

13.6K
How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.
13.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Corrigendum to "Piezo1 activation mediates stiffness-induced aortic medial calcification: Pharmacological evidence from agonist and antagonist studies" [Eur. J. Pharmacol. 1011 (2026) 178465].

European journal of pharmacology·2026
Same author

Risk factors for severe <i>Chlamydia pneumoniae</i> pneumonia in children: a retrospective case-control study.

Frontiers in pediatrics·2026
Same author

Foundation model for screening severe mitral regurgitation and severe aortic stenosis from coronary angiograms.

Visual computing for industry, biomedicine, and art·2026
Same author

Metabolic hormone and adipokine alterations in major depressive disorder in relation to the acute-phase inflammatory response and early-life adversity.

Communications medicine·2026
Same author

Case Report: Case of severe allergic reaction in a child caused by chlorhexidine skin disinfectant.

Frontiers in allergy·2026
Same author

A simultaneous dual watermarking scheme for deep learning models.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Supporting human-agent communication for explainable planning in spatial-temporal planning problems.

Neural computing & applications·2026
Same journal

Contrastive learning-based video quality assessment-jointed video vision transformer for video recognition.

Neural computing & applications·2026
Same journal

Sequential pattern transformer (SPT): a generative and interpretable framework for predicting disease trajectories.

Neural computing & applications·2026
Same journal

Balancing misclassification errors in image-based inference using problem domain semantics and a nested cascade architecture.

Neural computing & applications·2025
Same journal

A fairness scale for real-time recidivism forecasts using a national database of convicted offenders.

Neural computing & applications·2025
Same journal

Gene expression clock: an unsupervised deep learning approach for predicting circadian rhythmicity from whole genome expression.

Neural computing & applications·2025
See all related articles

Related Experiment Video

Updated: Jan 15, 2026

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.4K

Deep multi-objective reinforcement learning for utility-based infrastructural maintenance optimization.

Jesse van Remmerden1, Maurice Kenter2, Diederik M Roijers2,3

  • 1Information Systems IE&IS, Eindhoven University of Technology, De Zaale, 5600 MB Eindhoven, The Netherlands.

Neural Computing & Applications
|October 13, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces multi-objective deep centralized multi-agent actor-critic (MO-DCMAC) for infrastructure maintenance. MO-DCMAC optimizes policies for multiple objectives, outperforming traditional methods in cost and safety assessments.

Keywords:
InfrastructureMaintenanceMulti-objective reinforcement learningReinforcement learning

Related Experiment Videos

Last Updated: Jan 15, 2026

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.4K

Area of Science:

  • Artificial Intelligence
  • Civil Engineering
  • Operations Research

Background:

  • Infrastructure maintenance traditionally uses single-objective reinforcement learning (RL), often combining multiple goals like cost and safety into one reward.
  • This reward-shaping can oversimplify complex decision-making processes for asset management.

Purpose of the Study:

  • Introduce multi-objective deep centralized multi-agent actor-critic (MO-DCMAC) for direct multi-objective optimization in infrastructure maintenance.
  • Enable optimization even with nonlinear utility functions, improving upon traditional RL limitations.

Main Methods:

  • Developed MO-DCMAC, a novel multi-objective reinforcement learning approach.
  • Evaluated MO-DCMAC using threshold and Failure Mode, Effects, and Criticality Analysis (FMECA) utility functions.
  • Tested in diverse maintenance environments, including Amsterdam's historical quay walls, comparing against rule-based policies.

Main Results:

  • MO-DCMAC effectively optimizes maintenance policies for multiple objectives simultaneously.
  • Demonstrated superior performance compared to existing rule-based heuristic policies across various scenarios.
  • Validated the method's effectiveness with different utility functions and complex environments.

Conclusions:

  • MO-DCMAC offers a significant advancement over single-objective RL for infrastructure maintenance optimization.
  • The method provides a more robust and effective approach for balancing competing objectives like cost and safety.
  • This research paves the way for more sophisticated and efficient asset management strategies.