Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sampling Plans01:23

Sampling Plans

257
Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...
257
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

100
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
100
Random Sampling Method01:09

Random Sampling Method

12.3K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
12.3K
Systematic Sampling Method01:17

Systematic Sampling Method

11.1K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
Systematic sampling is one of the simplest methods...
11.1K
Systematic Error: Methodological and Sampling Errors01:15

Systematic Error: Methodological and Sampling Errors

2.2K
In the case of systematic errors, the sources can be identified, and the errors can be subsequently minimized by addressing these sources. According to the source, systematic errors can be divided into sampling, instrumental, methodological, and personal errors.
Sampling errors originate from improper sampling methods or the wrong sample population. These errors can be minimized by refining the sampling strategy. Defective instruments or faulty calibrations are the sources of instrumental...
2.2K
Trial and Error and Algorithm01:12

Trial and Error and Algorithm

190
A problem-solving strategy is a plan of action used to find a solution. Different strategies have distinct action plans. Trial and error involves trying different solutions until one works. For instance, to fix a broken printer, you might check ink levels, ensure the paper tray isn't jammed, and verify the printer's connection to your laptop. This method can be time-consuming but is commonly used. Thomas Edison, for example, used trial and error to find a suitable filament for the light...
190

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The myth of digital biomarkers in Alzheimer's disease: how to make them a reality.

Current opinion in psychiatry·2026
Same author

Resource-stratified machine learning framework for cognitive status classification and mild cognitive impairment to dementia progression prediction.

Alzheimer's research & therapy·2026
Same author

Network-Based Epidemic Control Through Optimal Travel and Quarantine Management.

IEEE transactions on control of network systems·2026
Same author

A BEACON for Novel Disease Threats: Leveraging Artificial Intelligence for Informal Event-Based Outbreak Surveillance.

The Journal of infectious diseases·2025
Same author

Developing an accessible dementia assessment tool: Leveraging a residual network, the trail making test, and demographic data.

Journal of Alzheimer's disease : JAD·2025
Same author

One-Shot Averaging for Distributed TD(<i>λ</i>) Under Markov Sampling.

IEEE control systems letters·2025
Same journal

On the control of recurrent neural networks using constant inputs.

IEEE transactions on automatic control·2026
Same journal

Robust Control Barrier Functions for Uncertain Parameter-Varying Control Affine Systems with Set-Membership Parameter Estimation.

IEEE transactions on automatic control·2026
Same journal

Estimation in Networks with Spatiotemporally Correlated Noise.

IEEE transactions on automatic control·2026
Same journal

Transient Analysis of Serial Production Lines With Perishable Products: Bernoulli Reliability Model.

IEEE transactions on automatic control·2024
Same journal

Solid Boundary Output Feedback Control of the Stefan Problem: The Enthalpy Approach.

IEEE transactions on automatic control·2024
Same journal

PAC Reinforcement Learning Algorithm for General-Sum Markov Games.

IEEE transactions on automatic control·2023
See all related articles
  1. Home
  2. Generalized Policy Improvement Algorithms With Theoretically Supported Sample Reuse.
  1. Home
  2. Generalized Policy Improvement Algorithms With Theoretically Supported Sample Reuse.

Related Experiment Video

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K

Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse.

James Queeney1, Ioannis Ch Paschalidis2, Christos G Cassandras2

  • 1Mitsubishi Electric Research Laboratories, Cambridge, MA 02139 USA. He performed the majority of this work while with the Division of Systems Engineering, Boston University, Boston, MA 02215 USA.

IEEE Transactions on Automatic Control
|August 20, 2025

View abstract on PubMed

Summary
This summary is machine-generated.

We introduce Generalized Policy Improvement, a new class of model-free deep reinforcement learning algorithms. These algorithms balance performance guarantees with data efficiency for real-world control applications.

Keywords:
Policy improvementpolicy optimizationreinforcement learningsample reuse

More Related Videos

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.0K
The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
11:53

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

11.8K

Related Experiment Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.0K
The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
11:53

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

11.8K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Control Systems

Background:

  • Model-free deep reinforcement learning (RL) is crucial for data-driven control.
  • Existing methods often face a trade-off between performance guarantees and data efficiency.
  • Real-world deployment requires balancing these two critical requirements.

Purpose of the Study:

  • To develop a novel class of model-free deep RL algorithms.
  • To address the performance guarantee vs. data efficiency trade-off in RL.
  • To enhance the applicability of RL in practical control scenarios.

Main Methods:

  • Development of Generalized Policy Improvement (GPI) algorithms.
  • Combining on-policy method guarantees with off-policy sample reuse efficiency.
  • Extensive experimental analysis on diverse simulated control tasks.
  • Main Results:

    • Demonstration of the benefits of the new GPI algorithms.
    • Successful balancing of performance guarantees and data efficiency.
    • Validation across a broad range of simulated control tasks.

    Conclusions:

    • The proposed GPI algorithms represent a significant advancement in model-free deep RL.
    • These algorithms offer a practical solution for real-world control problems.
    • GPI algorithms effectively bridge the gap between theoretical guarantees and practical data efficiency.