Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Comparison between RL and RC circuits

Comparison between RL and RC circuits

An RC circuit consists of resistance and capacitance, while in an RL circuit, capacitance is replaced by an inductor. RL and RC circuits are first-order differential circuits that store energy. An RC circuit stores energy in the electric field, while an RL circuit stores energy in the magnetic field. When connected to a battery, an RC circuit charges the capacitor, causing the current to decrease from maximum to zero upon being fully charged. This increases the voltage across the capacitor from...

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Open and closed-loop control systems

Open and closed-loop control systems

Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal and...

Control Systems

Control Systems

Control systems are everywhere in contemporary society, influencing diverse applications from aerospace to automated manufacturing. These systems can be found naturally within biological processes, such as blood sugar regulation and heart rate adjustment in response to stress, as well as in man-made systems like elevators and automated vehicles. A control system is essentially a network of subsystems and processes that collaboratively convert specific inputs into desired outputs.
At the heart...

The Power Flow Problem and Solution

The Power Flow Problem and Solution

Power flow problem analysis is fundamental for determining real and reactive power flows in network components, such as transmission lines, transformers, and loads. The power system's single-line diagram provides data on the bus, transmission line, and transformer. Each bus k in the system is characterized by four key variables: voltage magnitude Vk, phase angle δk, real power Pk, and reactive power Qk. Two of these four variables are inputs, while the power flow program computes the...

Time-Domain Interpretation of PD Control

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Fast reconstruction of degenerate populations of conductance-based neuron models from spike times.

PLoS computational biology·2026

Same author

Launching Austria's One Health network: paving the way for transdisciplinary collaborations.

One health outlook·2024

Same author

Warming up recurrent neural networks to maximise reachable multistability greatly improves learning.

Neural networks : the official journal of the International Neural Network Society·2023

Same author

Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments.

Sensors (Basel, Switzerland)·2022

Same author

A bio-inspired bistable recurrent cell allows for long-lasting memory.

PloS one·2021

Same author

The impact of different COVID-19 containment measures on electricity consumption in Europe.

Energy research & social science·2020

Same journal

Strategic Ability Updating in Concurrent Games by Coalitional Commitment.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2015

Same journal

Meta-Analysis of the First Facial Expression Recognition Challenge.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

Same journal

Adjustable model-based fusion method for multispectral and panchromatic images.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

Same journal

Face Feature Weighted Fusion Based on Fuzzy Membership Degree for Video Face Recognition.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

Same journal

A New Adaptive Fast Cellular Automaton Neighborhood Detection and Rule Identification Algorithm.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

Same journal

Human-arm-and-hand-dynamic model with variability analyses for a stylus-based haptic interface.

IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society·2012

See all related articles

Search research articles

Related Experiment Videos

Reinforcement learning versus model predictive control: a comparison on a power system problem.

Damien Ernst¹, Mevludin Glavic, Florin Capitanescu

¹Belgian National Fund for Scientific Research, Brussels, Belgium. ernst@montefiore.ulg.ac.be

IEEE Transactions on Systems, Man, and Cybernetics. Part B, Cybernetics : a Publication of the IEEE Systems, Man, and Cybernetics Society

|December 20, 2008

Summary

This summary is machine-generated.

This study compares reinforcement learning (RL) and model predictive control (MPC) for electrical power oscillation damping. Results show RL can be competitive with MPC, even when accurate system models are available.

Related Experiment Videos

Area of Science:

Control Systems Engineering
Machine Learning
Power Systems

Background:

Electrical power systems require robust controllers to damp oscillations.
Model Predictive Control (MPC) and Reinforcement Learning (RL) are advanced control strategies.

Purpose of the Study:

To compare the performance of RL and MPC in a unified framework.
To evaluate their application in synthesizing a controller for nonlinear electrical power oscillation damping.

Main Methods:

Both MPC and RL were formulated as discrete-time optimal control problems.
MPC utilized an analytical system model and an interior-point solver for open-loop policies.
RL employed a model-free approach, inferring closed-loop policies from system trajectories and cost values via supervised learning.

Main Results:

Experimental results were obtained for a nonlinear, deterministic electrical power oscillation damping problem.
The study provides insights into the advantages and disadvantages of both MPC and RL.

Conclusions:

Reinforcement learning demonstrates competitive performance against model predictive control.
RL is a viable alternative even in scenarios where a precise deterministic system model is accessible.