Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Prevalence and Incidence

Prevalence and Incidence

In statistical epidemiology and health sciences, two essential metrics—prevalence and incidence—are fundamental for understanding disease dynamics within a population. These measures enable public health officials, epidemiologists, and researchers to assess the burden of diseases, allocate resources effectively, and design impactful public health policies and interventions.
Prevalence indicates the proportion of individuals in a population who have a specific disease or health condition at a...

Distributions to Estimate Population Parameter

Distributions to Estimate Population Parameter

The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...

Steps in Outbreak Investigation

Steps in Outbreak Investigation

In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:

Choosing Between z and t Distribution

Choosing Between z and t Distribution

The z and the Student t distribution estimate the population mean using the sample mean and standard deviation. However, to decide which distribution to use for a calculation, one needs to determine the sample size, the nature of the distribution, and whether the population standard deviation is known. If the population standard deviation is known and the population is normally distributed, or if the sample size is greater than 30, the z distribution is preferred. The Student t distribution is...

Statistical Methods for Analyzing Epidemiological Data

Statistical Methods for Analyzing Epidemiological Data

Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the Guinness...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Two-Step Error-Controlling Classifiers With Application to Cost-Effective Disease Diagnosis.

Statistics in medicine·2026

Same author

Estimating controlled direct treatment effects on pain intensity using structural mean models.

Pain reports·2026

Same author

Factors affecting power in stepped wedge trials when the treatment effect varies with time.

Trials·2026

Same author

Joint modelling of competing risks and current status data: an application to a spontaneous labour study.

Journal of the Royal Statistical Society. Series C, Applied statistics·2025

Same author

Modeling the age-specific incidence of mild cognitive impairment incorporating the time-varying relationship of Alzheimer's disease biomarkers over 28 years.

Journal of Alzheimer's disease : JAD·2025

Same author

Weighted Brier Score - an Overall Summary Measure for Risk Prediction Models with Clinical Utility Consideration.

Statistics in biosciences·2025

Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026

Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026

Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026

Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026

Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026

Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 25, 2026

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

Estimating incident population distribution from prevalent data.

Kwun Chuen Gary Chan¹, Mei-Cheng Wang

¹Department of Biostatistics and Department of Health Services, University of Washington, Seattle, Washington 98195, USA. kcgchan@u.washington.edu

|February 9, 2012

Summary

This summary is machine-generated.

Prevalent sampling for disease studies is economical but introduces bias. This research develops methods to accurately estimate baseline variable distributions from biased prevalent data, improving survival analysis for incident disease populations.

More Related Videos

Trajectory Data Analyses for Pedestrian Space-time Activity Study

Trajectory Data Analyses for Pedestrian Space-time Activity Study

Published on: February 25, 2013

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Related Experiment Videos

Last Updated: May 25, 2026

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

Trajectory Data Analyses for Pedestrian Space-time Activity Study

Trajectory Data Analyses for Pedestrian Space-time Activity Study

Published on: February 25, 2013

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Area of Science:

Biostatistics
Epidemiology
Survival Analysis

Background:

Prevalent sampling is cost-effective for studying disease survival distributions.
However, prevalent samples are inherently biased, overrepresenting individuals with longer survival times.
This bias can distort estimates of baseline variable distributions, impacting study validity.

Purpose of the Study:

To develop methods for accurately estimating baseline variable distributions in incident disease populations using prevalent data.
To address the inherent biases associated with prevalent sampling schemes.
To provide reliable statistical tools for survival analysis in diseased populations.

Main Methods:

Development of nonparametric and semiparametric statistical methods.
Focus on estimating the distribution function of baseline variables.
Application to data collected via prevalent sampling.

Main Results:

The study introduces novel methods to correct for sampling bias in prevalent cohorts.
These methods enable more accurate estimation of population distribution functions for baseline variables.
Demonstrates the potential for serious bias when ignoring prevalent sampling effects.

Conclusions:

Accurate estimation of baseline variable distributions is crucial for valid survival analysis.
The developed nonparametric and semiparametric methods offer solutions for utilizing prevalent data effectively.
Researchers should acknowledge and correct for biases in prevalent sampling to ensure reliable findings.