Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Expected Value01:15

Expected Value

8.3K
The expected value is known as the "long-term" average or mean. This means that over the long term of experimenting over and over, you would expect this average. The expected average is represented by the symbol μ. It is calculated as follows:
8.3K
Estimating Population Mean with Unknown Standard Deviation01:22

Estimating Population Mean with Unknown Standard Deviation

9.2K
In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...
9.2K
Estimating Population Mean with Known Standard Deviation01:16

Estimating Population Mean with Known Standard Deviation

10.0K
To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...
10.0K
Margin of Error01:27

Margin of Error

8.1K
The margin of error is also called the maximum error of an estimate. The margin of error is the maximum possible or expected difference between the observed sample parameter value and the actual population parameter value. For proportion, it is the maximum difference between the value of sample proportion obtained from the data and the true value of population proportion. As the true value of the population parameter is not known, the margin of error is calculated using the sample statistic.
8.1K
What are Estimates?01:06

What are Estimates?

9.3K
It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates. 
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...
9.3K
Standard Error of the Mean01:13

Standard Error of the Mean

13.3K
The sampling variability of a statistic is defined as how much the statistic varies from one sample to another. The sampling variability of a statistic is typically measured by measuring its standard error.
The standard error of the mean is an example of a standard error. It is a unique standard deviation known as the standard deviation of the sampling distribution of the mean. The standard error of the mean is a statistic that calculates how correctly a sample distribution represents a...
13.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Reply to Liu et al.: Beyond Frequency: Rethinking Exacerbation Risk in COPD.

American journal of respiratory and critical care medicine·2026
Same author

Performance of multivariable risk prediction algorithms in predicting COPD exacerbations: a population-based study.

Thorax·2026
Same author

Oral anticoagulant switching in patients with atrial fibrillation: a long-term cohort study using population-level administrative data.

International journal of clinical pharmacy·2026
Same author

Prediction of exposure to chrysotile asbestos fibers among Quebec miners and millers based on impinger measurements.

Annals of work exposures and health·2026
Same author

Life Expectancy in Chronic Obstructive Pulmonary Disease.

JAMA internal medicine·2026
Same author

Development of an asthma policy model for Canada: Lifetime Exposures and Asthma outcomes Projection.

PloS one·2026
Same journal

A Causal Framework for Evaluating the Total Effect of Strategies Aiming to Expand Screening and to Improve Outcomes.

Statistics in medicine·2026
Same journal

Causal Effects on Nonterminal Event Time With Application to Antibiotic Usage and Future Resistance.

Statistics in medicine·2026
Same journal

Subgroup Analysis of Interval-censored Failure Time Data With Application to Alzheimer's Disease.

Statistics in medicine·2026
Same journal

Rejoinder to Commentaries on "A Perspective on the Appropriate Implementation of ICH E9(R1) Addendum Strategies for Handling Intercurrent Events".

Statistics in medicine·2026
Same journal

A Multi-Stage Drop-the-Loser Design With Superiority Boundaries.

Statistics in medicine·2026
Same journal

Interpretable ROI Identification in Brain Image Analysis: Overcoming CNN Black Box Challenges With Kriging-Enhanced Adaptive Sampling.

Statistics in medicine·2026
See all related articles

Related Experiment Video

Updated: Apr 15, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.8K

Expected Value of Sample Information Calculations for Risk Prediction Model Development.

Abdollah Safari1, Paul Gustafson2, Mohsen Sadatsafavi3

  • 1School of Mathematics, Statistics, and Computer Science, Faculty of Science, University of Tehran, Tehran, Iran.

Statistics in Medicine
|April 13, 2026
PubMed
Summary
This summary is machine-generated.

Developing risk prediction models requires careful consideration of sample size. This study introduces the Expected Value of Sample Information (EVSI) to quantify the clinical utility gain from additional data, aiding study design.

Keywords:
Bayesian statisticsdecision theoryprecision medicinepredictive analyticsvalue of information

More Related Videos

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

8.3K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

11.1K

Related Experiment Videos

Last Updated: Apr 15, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.8K
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

8.3K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

11.1K

Area of Science:

  • Biostatistics
  • Clinical Epidemiology
  • Decision Analysis

Background:

  • Risk prediction models are often presented as deterministic, but finite sample sizes introduce inherent uncertainty.
  • Traditional methods address performance metric uncertainty and prediction stability, but not clinical utility directly.
  • Statistical inference is less relevant for evaluating clinical utility using metrics like net benefit.

Purpose of the Study:

  • To define and quantify the Expected Value of Sample Information (EVSI) for risk prediction model development.
  • To evaluate the expected gain in clinical utility from acquiring additional development data.
  • To propose a decision-theoretic approach complementing classical inferential methods in study design.

Main Methods:

  • Defined EVSI as the expected gain in net benefit (NB) from an additional development sample.
  • Proposed a bootstrap-based algorithm for computing EVSI.
  • Demonstrated the feasibility and face validity of the EVSI computation algorithm in a case study.

Main Results:

  • The Expected Value of Sample Information (EVSI) was defined and computed using a novel bootstrap algorithm.
  • The study showed that procuring more development data is associated with an expected gain in model utility.
  • The proposed method demonstrated feasibility and face validity in a practical case study.

Conclusions:

  • Decision-theoretic metrics, such as EVSI, can effectively complement classical inferential methods for designing risk prediction model studies.
  • EVSI provides a framework to quantify the value of additional data in terms of clinical utility.
  • This approach aids in optimizing sample size and resource allocation for developing robust risk prediction models.