Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Fast Decoupled and DC Powerflow

Fast Decoupled and DC Powerflow

The fast decoupled power flow method addresses contingencies in power system operations, such as generator outages or transmission line failures. This method provides quick power flow solutions, essential for real-time system adjustments. Fast decoupled power flow algorithms simplify the Jacobian matrix by neglecting certain elements, leading to two sets of decoupled equations:

Statically Indeterminate Problem Solving

Statically Indeterminate Problem Solving

Statically indeterminate problems are those where statics alone can not determine the internal forces or reactions. Consider a structure comprising two cylindrical rods made of steel and brass. These rods are joined at point B and restrained by rigid supports at points A and C. Now, the reactions at points A and C and the deflection at point B are to be determined. This rod structure is classified as statically indeterminate as the structure has more supports than are necessary for maintaining...

Time-Domain Interpretation of PD Control

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

PD Controller: Design

PD Controller: Design

In automotive engineering, car suspension systems often employ Proportional Derivative (PD) controllers to enhance performance. PD controllers are utilized to adjust the damping force in response to road conditions. A controller, acting as an amplifier with a constant gain, demonstrates proportional control, with output directly mirroring input.
Designing a continuous-data controller requires selecting and linking components like adders and integrators, which are fundamental in Proportional,...

Bernoulli's Equation: Problem Solving

Bernoulli's Equation: Problem Solving

A Venturi meter is essential for measuring fluid flow rates in pipelines. It utilizes the relationship between fluid velocity and pressure described by Bernoulli's equation. When installed in a sewage system, the Venturi meter accurately determines the wastewater flow rate by measuring pressure differences.
The first step is to compute the cross-sectional areas of the pipe and the Venturi throat to analyze the pressure difference indicated by the pressure gauge. Next, the continuity...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Effects of Non-Pharmacological Interventions on Degree of Thirst and Oral Mucosal Moisture in Critically Ill Patients: A Systematic Review and Meta-Analysis.

Nursing in critical care·2026

Same author

The effect of CHIN-SKIP sports game intervention on motor ability in preschool children aged 5-6 years.

Frontiers in public health·2025

Same author

Fixed-time adaptive neural network compensation control for uncertain nonlinear systems.

Neural networks : the official journal of the International Neural Network Society·2025

Same author

A thermostable OTA-detoxifying hydrolase from Thermonema rossianum: identification, characterization, structure, catalytic mechanism, and application.

Food chemistry·2025

Same author

Root microbiota regulates tiller number in rice.

Cell·2025

Same author

Single cell transcriptome sequencing indicates the cellular heterogeneity of small intestine tissue in celiac disease.

Scientific reports·2025

Same journal

Raising the Bar in Graph OOD Generalization: Invariant Learning beyond Explicit Environment Modeling.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

LoRASculpt: Harmonious Low-Rank Adaptation for Multimodal Large Language Models.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Linearly Solving Robust Rotation Estimation.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Adapting Dense Vision-Language Relationships for Multi-label Classification with Partial Label.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Forensics Adapter: Unleashing CLIP for Generalizable Face Forgery Detection.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

MoE-Enhanced Explainable Deep Manifold Transformation for Complex Data Embedding and Visualization.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 20, 2025

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

A New Accelerated Off-Policy Stochastic Preconditioned TD(0) Algorithm.

Weidong Liu, Jiahua Ma, Xiaojun Mao

IEEE Transactions on Pattern Analysis and Machine Intelligence

|May 23, 2025

Summary

This summary is machine-generated.

We introduce Stochastic Preconditioned Temporal Difference (SPTD), a novel method for off-policy reinforcement learning policy evaluation. SPTD achieves the optimal O(1/t) convergence rate, outperforming existing techniques in extensive numerical experiments.

More Related Videos

Diffusion Tensor Magnetic Resonance Imaging in the Analysis of Neurodegenerative Diseases

Diffusion Tensor Magnetic Resonance Imaging in the Analysis of Neurodegenerative Diseases

Published on: July 28, 2013

A Tactile Automated Passive-Finger Stimulator TAPS

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

Related Experiment Videos

Last Updated: Sep 20, 2025

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Diffusion Tensor Magnetic Resonance Imaging in the Analysis of Neurodegenerative Diseases

Diffusion Tensor Magnetic Resonance Imaging in the Analysis of Neurodegenerative Diseases

Published on: July 28, 2013

A Tactile Automated Passive-Finger Stimulator TAPS

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

Area of Science:

Artificial Intelligence
Machine Learning
Reinforcement Learning

Background:

Off-policy policy evaluation is crucial for reinforcement learning.
Existing methods often lack optimal convergence rates.
Linear function approximation is widely used in RL.

Purpose of the Study:

To propose a novel procedure for off-policy policy evaluation.
To achieve the optimal convergence rate under linear function approximation.
To analyze the finite-sample rates and asymptotic distribution.

Main Methods:

Stochastic Preconditioned Temporal Difference (SPTD) algorithm.
Analysis under Markovian sampling for differing policies.
Derivation of finite-sample rates and asymptotic distribution.

Main Results:

SPTD achieves the optimal O(1/t) convergence rate (MSE).
Linear computational complexity in feature space dimension.
First results on asymptotic distribution and near-optimal step size (αt = O(t^{-2/3})).
Uniformly outperforms existing methods in numerical experiments.

Conclusions:

SPTD offers a theoretically optimal and practically superior approach to off-policy policy evaluation.
The method demonstrates strong performance in both on-policy and off-policy settings.
SPTD advances the state-of-the-art in reinforcement learning evaluation.