Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Hindsight Biases

Hindsight Biases

Hindsight bias leads you to believe that the event you just experienced was predictable, even though it really wasn’t. In other words, you knew all along that things would turn out the way they did. Can you relate this to the phrase "Hindsight is 20/20" now?

Inductive Reasoning

Inductive Reasoning

Inductive reasoning is a form of logical thinking that uses related observations to arrive at a general conclusion. It is uncertain and operates in degrees to which the conclusions are credible. As such, inductive arguments can be weak or strong, rather than valid or invalid, and conclusions can be used to formulate testable, falsifiable hypotheses.
Inductive reasoning is common in descriptive science. A life scientist makes observations and records them. This data can be qualitative or...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Data Validation

Data Validation

Data validation is an essential part of a comprehensive assessment. Validation is confirming or verifying and opening the door to gathering more assessment data as it clarifies vague or unclear data. The process of checking and verifying the collected information is called data validation. The primary purpose of data validation is to ensure data is as free from error, bias, and misinterpretation as possible.
Nursing assessment guides are generally based on holistic models rather than medical...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Systematic estimates of global causes of neonatal and under 5 mortality in 2000-24: secondary data analysis using bayesian multinomial logistic regression.

BMJ (Clinical research ed.)·2026

Same author

Unearthing soil biodiversity through collaborative genomic research and education.

Nature genetics·2025

Same author

Insights into the Datasets, Tools, and Training Needs of the AnVIL Community: 2024.

bioRxiv : the preprint server for biology·2025

Same author

What's the Weight? Estimating Controlled Outcome Differences in Complex Surveys for Health Disparities Research.

Statistics in medicine·2025

Same author

Transcriptome Analysis of Dimethyl Fumarate Inhibiting the Growth of <i>Aspergillus carbonarius</i>.

Toxins·2025

Same author

ipd: an R package for conducting inference on predicted data.

Bioinformatics (Oxford, England)·2025

Same journal

Tau protein as a regulator of mitochondrial function and dynamics.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

A scalable, dividing cell model for the robust propagation and quantification of human sporadic Creutzfeldt-Jakob disease prions.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Epigenetic regulation of mesenchymal BMP signaling directs postnatal organ innervation.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Single-shot wide-field biochemical imaging at 1 kHz frame rate.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Morphogenesis and topological evolution of a frustrated nematic liquid crystal under confinement.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

B cell-intrinsic CXCR3 drives efficient generation of ectopic pulmonary germinal center responses to influenza A virus infection.

Proceedings of the National Academy of Sciences of the United States of America·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 29, 2025

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

Methods for correcting inference based on outcomes predicted by machine learning.

Siruo Wang¹, Tyler H McCormick^2,3, Jeffrey T Leek⁴

¹Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205.

Proceedings of the National Academy of Sciences of the United States of America

|November 19, 2020

Summary

This summary is machine-generated.

This study introduces postprediction inference (postpi), a novel method to correct statistical inference when using machine learning predicted outcomes. The postpi approach improves accuracy in medical and public health predictions.

Keywords:

interpretability machine learning postprediction inference statistics

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Related Experiment Videos

Last Updated: Nov 29, 2025

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Area of Science:

Biostatistics
Machine Learning
Computational Biology

Background:

Machine learning models are increasingly used for outcome prediction in medicine and public health.
Statistical inference often fails to account for the difference between observed and predicted outcomes, leading to potential biases.

Purpose of the Study:

To develop and validate a method for correcting statistical inference when using predicted outcomes from complex machine learning models.
To improve the accuracy of variance estimation and subsequent statistical analyses in postprediction scenarios.

Main Methods:

Developed a postprediction inference (postpi) approach that models the relationship between observed and predicted outcomes.
Utilized a training, testing, and validation set framework to train prediction models and correct inference.
Applied the method to diverse datasets, including gene expression and verbal autopsy data.

Main Results:

The postpi method effectively corrects bias in statistical inference using predicted outcomes.
Demonstrated improvements in variance estimation and overall inference accuracy.
Validated the approach's broad applicability across different biomedical fields.

Conclusions:

Postprediction inference (postpi) offers a robust solution for accurate statistical analysis with machine learning-predicted outcomes.
The method enhances reliability in medical and public health research by addressing biases inherent in using predicted data.
An open-source R package is available for implementing the postpi approach.