Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

4.9K
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
4.9K
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

188
Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
188
Maxwell-Boltzmann Distribution: Problem Solving01:20

Maxwell-Boltzmann Distribution: Problem Solving

2.7K
Individual molecules in a gas move in random directions, but a gas containing numerous molecules has a predictable distribution of molecular speeds, which is known as the Maxwell-Boltzmann distribution, f(v).
This distribution function f(v) is defined by saying that the expected number N (v1,v2) of particles with speeds between v1 and v2 is given by
2.7K
Parametric Survival Analysis: Weibull and Exponential Methods01:14

Parametric Survival Analysis: Weibull and Exponential Methods

870
Parametric survival analysis models survival data by assuming a specific probability distribution for the time until an event occurs. The Weibull and exponential distributions are two of the most commonly used methods in this context, due to their versatility and relatively straightforward application.
Weibull Distribution
The Weibull distribution is a flexible model used in parametric survival analysis. It can handle both increasing and decreasing hazard rates, depending on its shape parameter...
870
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

346
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
346
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

980
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
980

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

Learning to Super-Resolve Face Images via Dual-Domain Multi-scale Feature Interaction.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Effectiveness of heterologous mRNA vaccine boosters during an Omicron wave of COVID-19: a cross-sectional study in Macao (China).

Journal of thoracic disease·2026
Same author

Fast BCIs: Leveraging Dual-Scale Time Windows with Test-Time Adaptation to Enhance Accuracy.

IEEE transactions on bio-medical engineering·2026
Same author

Riemannian Acceleration for Sparse PCA With Separable Structure and Second-Order Information Exploration.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Hierarchical memory-based deep reinforcement learning in simulated survival environments.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Robust Semiglobal and Global Stabilization for Nonlinear Normal Form Systems by Time-Varying Feedback.

IEEE transactions on cybernetics·2026
Same journal

Adaptive Global Asymptotic Output Stabilization of Uncertain Nonlinear Systems Under Dynamic State/Input Quantization.

IEEE transactions on cybernetics·2026
Same journal

Accelerated Distributed Gradient Tracking for Constrained Aggregative Optimization Over Time-Varying Digraphs.

IEEE transactions on cybernetics·2026
Same journal

Small-Gain-Based Plug-and-Play Distributed Control Framework for DC Microgrids With Decentralized Reconfiguration.

IEEE transactions on cybernetics·2026
Same journal

Prescribed-Time Impulsive Control of High-Order Integrator Systems.

IEEE transactions on cybernetics·2026
Same journal

Relaxed Stability Conditions for Model Predictive Control of Hybrid Dynamical Systems Using Hybrid Recurrent Neural Networks.

IEEE transactions on cybernetics·2026
See all related articles

Related Experiment Video

Updated: Dec 6, 2025

A Tactile Automated Passive-Finger Stimulator TAPS
19:44

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

14.1K

Inference-Based Posteriori Parameter Distribution Optimization.

Xuesong Wang, Tianyi Li, Yuhu Cheng

    IEEE Transactions on Cybernetics
    |October 7, 2020
    PubMed
    Summary
    This summary is machine-generated.

    The novel inference-based posteriori parameter distribution optimization (IPPDO) algorithm enhances reinforcement learning (RL) exploration by stabilizing parameter distribution learning. This deep RL method improves data efficiency and achieves faster rewards with greater stability.

    More Related Videos

    A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
    12:39

    A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

    Published on: December 10, 2012

    11.6K
    A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
    08:12

    A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

    Published on: March 1, 2022

    2.8K

    Related Experiment Videos

    Last Updated: Dec 6, 2025

    A Tactile Automated Passive-Finger Stimulator TAPS
    19:44

    A Tactile Automated Passive-Finger Stimulator TAPS

    Published on: June 3, 2009

    14.1K
    A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
    12:39

    A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

    Published on: December 10, 2012

    11.6K
    A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
    08:12

    A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

    Published on: March 1, 2022

    2.8K

    Area of Science:

    • Artificial Intelligence
    • Machine Learning
    • Reinforcement Learning

    Background:

    • Encouraging agent exploration is crucial yet challenging in reinforcement learning (RL).
    • Distributional representations can enhance exploration but may cause instability and inefficiency.
    • Existing methods struggle with stable and efficient parameter distribution learning.

    Purpose of the Study:

    • To propose a novel algorithm, inference-based posteriori parameter distribution optimization (IPPDO), for accelerating and stabilizing parameter distribution learning in RL.
    • To design objective functions for both continuous and discrete action tasks based on probability's evidence lower bound.
    • To improve data efficiency in deep RL (DRL) through off-policy learning.

    Main Methods:

    • Developed IPPDO algorithm for parameter distribution optimization using an inference-based approach.
    • Designed specific objective functions for continuous and discrete action spaces.
    • Employed multiple neural networks with Retrace to mitigate value function overestimation.
    • Introduced an activation function on the standard deviation for adaptive weight sampling.
    • Utilized off-policy techniques like experience replay for enhanced data efficiency.

    Main Results:

    • IPPDO demonstrated improved exploration in the action space across continuous and discrete tasks.
    • The algorithm achieved higher rewards more rapidly compared to prevailing DRL algorithms.
    • Experiments on OpenAI Gym and MuJoCo platforms confirmed IPPDO's algorithmic stability.
    • IPPDO effectively balances fixed parameter values and distributional representations.

    Conclusions:

    • IPPDO offers a stable and efficient method for parameter distribution learning in DRL.
    • The algorithm significantly enhances exploration capabilities and learning speed.
    • IPPDO presents a promising advancement for off-policy deep reinforcement learning, improving data efficiency and performance.