Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Comparison between RL and RC circuits01:24

Comparison between RL and RC circuits

An RC circuit consists of resistance and capacitance, while in an RL circuit, capacitance is replaced by an inductor. RL and RC circuits are first-order differential circuits that store energy. An RC circuit stores energy in the electric field, while an RL circuit stores energy in the magnetic field. When connected to a battery, an RC circuit charges the capacitor, causing the current to decrease from maximum to zero upon being fully charged. This increases the voltage across the capacitor from...
Feedback control systems01:26

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
Open and closed-loop control systems01:17

Open and closed-loop control systems

Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal and...
Control Systems01:10

Control Systems

Control systems are everywhere in contemporary society, influencing diverse applications from aerospace to automated manufacturing. These systems can be found naturally within biological processes, such as blood sugar regulation and heart rate adjustment in response to stress, as well as in man-made systems like elevators and automated vehicles. A control system is essentially a network of subsystems and processes that collaboratively convert specific inputs into desired outputs.
At the heart...
The Power Flow Problem and Solution01:26

The Power Flow Problem and Solution

Power flow problem analysis is fundamental for determining real and reactive power flows in network components, such as transmission lines, transformers, and loads. The power system's single-line diagram provides data on the bus, transmission line, and transformer. Each bus k in the system is characterized by four key variables: voltage magnitude Vk​, phase angle δk​, real power Pk​, and reactive power Qk​. Two of these four variables are inputs, while the power flow program computes the...
Time-Domain Interpretation of PD Control01:07

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Fast reconstruction of degenerate populations of conductance-based neuron models from spike times.

PLoS computational biology·2026
Same author

Launching Austria's One Health network: paving the way for transdisciplinary collaborations.

One health outlook·2024
Same author

Warming up recurrent neural networks to maximise reachable multistability greatly improves learning.

Neural networks : the official journal of the International Neural Network Society·2023
Same author

Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments.

Sensors (Basel, Switzerland)·2022
Same author

A bio-inspired bistable recurrent cell allows for long-lasting memory.

PloS one·2021
Same author

The impact of different COVID-19 containment measures on electricity consumption in Europe.

Energy research & social science·2020
Same journal

Strategic Ability Updating in Concurrent Games by Coalitional Commitment.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2015
Same journal

Meta-Analysis of the First Facial Expression Recognition Challenge.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
Same journal

Adjustable model-based fusion method for multispectral and panchromatic images.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
Same journal

Face Feature Weighted Fusion Based on Fuzzy Membership Degree for Video Face Recognition.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
Same journal

A New Adaptive Fast Cellular Automaton Neighborhood Detection and Rule Identification Algorithm.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
Same journal

Human-arm-and-hand-dynamic model with variability analyses for a stylus-based haptic interface.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012
See all related articles

Related Experiment Videos

Reinforcement learning versus model predictive control: a comparison on a power system problem.

Damien Ernst1, Mevludin Glavic, Florin Capitanescu

  • 1Belgian National Fund for Scientific Research, Brussels, Belgium. ernst@montefiore.ulg.ac.be

IEEE Transactions on Systems, Man, and Cybernetics. Part B, Cybernetics : a Publication of the IEEE Systems, Man, and Cybernetics Society
|December 20, 2008
PubMed
Summary
This summary is machine-generated.

This study compares reinforcement learning (RL) and model predictive control (MPC) for electrical power oscillation damping. Results show RL can be competitive with MPC, even when accurate system models are available.

Related Experiment Videos

Area of Science:

  • Control Systems Engineering
  • Machine Learning
  • Power Systems

Background:

  • Electrical power systems require robust controllers to damp oscillations.
  • Model Predictive Control (MPC) and Reinforcement Learning (RL) are advanced control strategies.

Purpose of the Study:

  • To compare the performance of RL and MPC in a unified framework.
  • To evaluate their application in synthesizing a controller for nonlinear electrical power oscillation damping.

Main Methods:

  • Both MPC and RL were formulated as discrete-time optimal control problems.
  • MPC utilized an analytical system model and an interior-point solver for open-loop policies.
  • RL employed a model-free approach, inferring closed-loop policies from system trajectories and cost values via supervised learning.

Main Results:

  • Experimental results were obtained for a nonlinear, deterministic electrical power oscillation damping problem.
  • The study provides insights into the advantages and disadvantages of both MPC and RL.

Conclusions:

  • Reinforcement learning demonstrates competitive performance against model predictive control.
  • RL is a viable alternative even in scenarios where a precise deterministic system model is accessible.