Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Sample Size Calculation

Sample Size Calculation

Knowledge of the sample size is the first requirement to conduct random sampling or an experiment. The sample size is the total number of units, observations, or groups (in some cases) used to get the data to estimate a population parameter. As the name suggests, the sample size is that of the sample drawn from the population and differs from the population size.
The sample size for the given experiment or sampling effort is fundamental to any study design. Sample size decides the number of...

Sample Proportion and Population Proportion

Sample Proportion and Population Proportion

Collecting samples or responses from an entire population takes significant time and effort, so a researcher collects responses from only a sample of that population. Suppose a study needs to collect information about a specific mobile application. After sample collection, the researcher analyzes the data and discovers that most individuals in the sample use that specific mobile application. The sample proportion measures the number of individuals in a sample who either use or don't use the...

Sampling Plans

Sampling Plans

Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...

Censoring Survival Data

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different reasons...

Statistical Methods for Analyzing Epidemiological Data

Statistical Methods for Analyzing Epidemiological Data

Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:

Analysis of Population Pharmacokinetic Data

Analysis of Population Pharmacokinetic Data

Analysis of population pharmacokinetic data involves studying the behavior of drugs within diverse populations to understand their pharmacokinetic parameters. Traditional pharmacokinetic methods typically involve collecting samples from a few individuals and estimating these parameters. While these methods are commonly used, they have limitations in capturing the variability in drug response among individuals or heterogeneous populations. Population pharmacokinetics is employed to address these...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Spatially Correlated Analysis of Infectious Disease Outcomes Based on Bayesian Functional Hierarchical Models.

Statistics in medicine·2026

Same author

Rank-based methods for assessing equivalence/non-inferiority with assay sensitivity in a three-arm trial with ordinal endpoints.

Statistical methods in medical research·2026

Same author

Functional varying-coefficient Cox model and its application.

Statistical methods in medical research·2026

Same author

The Improved EMS Algorithm for Latent Variable Selection in M3PL Model.

Applied psychological measurement·2024

Same author

Bayesian analysis of joint quantile regression for multi-response longitudinal data with application to primary biliary cirrhosis sequential cohort study.

Statistical methods in medical research·2024

Same author

Sample size determination for interval estimation of the prevalence of a sensitive attribute under non-randomized response models.

The British journal of mathematical and statistical psychology·2024

Same journal

Asymptotic online FWER control for dependent test statistics.

Statistical methods in medical research·2026

Same journal

Regression analysis of misclassified current status data with potentially unknown test accuracy.

Statistical methods in medical research·2026

Same journal

Bayesian multivariate linear mixed-effects models with varied association structures.

Statistical methods in medical research·2026

Same journal

Inference about the ratio of age-standardized rates between two overlapping populations.

Statistical methods in medical research·2026

Same journal

A robust neural network with random effects for subject-specific prediction of clustered count data.

Statistical methods in medical research·2026

Same journal

A comparison of methods for designing hybrid type 2 cluster-randomized trials with continuous effectiveness and implementation endpoints.

Statistical methods in medical research·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 24, 2026

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Sample size determination for disease prevalence studies with partially validated data.

Shi-Fang Qiu¹, Wai-Yin Poon², Man-Lai Tang³

¹Department of Statistics, Chongqing University of Technology, China sfqiu@amss.ac.cn.

Statistical Methods in Medical Research

|March 1, 2012

Summary

This summary is machine-generated.

This study determines optimal sample sizes for disease prevalence research using partially validated data. It ensures accurate statistical power and confidence intervals, crucial for reliable epidemiological findings.

Keywords:

Asymptotic inference disease prevalence double-sampling partially validated data sample size

More Related Videos

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

Related Experiment Videos

Last Updated: May 24, 2026

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

Area of Science:

Epidemiology
Biostatistics
Medical Research Methodology

Background:

Disease prevalence studies are vital for medical research.
Accurate classification of subjects is essential but challenging due to costly gold-standard tests and fallible screening tests.
Partially validated datasets offer a compromise, using both screening and gold-standard tests on subsets of data.

Purpose of the Study:

To investigate methods for determining appropriate sample sizes in disease prevalence studies that utilize partially validated datasets.
To provide researchers with tools to ensure adequate statistical power and precise confidence intervals when dealing with imperfect data.
To enhance the reliability and efficiency of sample size calculations in epidemiological research.

Main Methods:

The study employs two primary approaches for sample size determination: achieving pre-specified statistical power at a given significance level, and controlling the width of a confidence interval at a specified confidence level.
Empirical studies were conducted to evaluate the performance of different testing procedures using the proposed sample size determination methods.
The practical utility of the developed methods was demonstrated through an analysis of a real-world dataset.

Main Results:

The research provides validated methods for calculating sample sizes tailored to partially validated data in disease prevalence studies.
The proposed methods effectively balance the trade-offs between testing costs and data accuracy.
Empirical evaluations confirmed the performance of the sample size determination techniques.

Conclusions:

The developed sample size determination methods are applicable to disease prevalence studies using partially validated data.
These methods offer a robust framework for planning studies, ensuring statistical rigor and efficient resource allocation.
The findings contribute to improving the design and execution of epidemiological research involving imperfect diagnostic tests.