Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Sampling Plans

Sampling Plans

Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Sample Size Calculation

Sample Size Calculation

Knowledge of the sample size is the first requirement to conduct random sampling or an experiment. The sample size is the total number of units, observations, or groups (in some cases) used to get the data to estimate a population parameter. As the name suggests, the sample size is that of the sample drawn from the population and differs from the population size.
The sample size for the given experiment or sampling effort is fundamental to any study design. Sample size decides the number of...

One-Way ANOVA: Unequal Sample Sizes

One-Way ANOVA: Unequal Sample Sizes

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

Estimating Population Mean with Unknown Standard Deviation

Estimating Population Mean with Unknown Standard Deviation

In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Abdominal adiposity and Alzheimer's disease imaging markers across sex and race at midlife.

Journal of Alzheimer's disease : JAD·2026

Same author

Optimal Designs in Open-Cohort Longitudinal Cluster Randomized Trials With a Continuous Outcome.

Biometrical journal. Biometrische Zeitschrift·2026

Same author

Assessing COVID-19 Testing Strategies in K-12 Schools in Underserved Populations.

Journal of public health management and practice : JPHMP·2026

Same author

[11C]CS1P1 PET links T-cell-associated immune activation with endothelial and astrocytic responses.

Research square·2026

Same author

Laser interstitial thermal therapy and adjuvant pembrolizumab in recurrent high-grade astrocytoma: a Phase 1/randomized Phase 2b trial.

Nature communications·2026

Same author

Nanoparticle Albumin-Bound Paclitaxel and Nivolumab for PD-1 Inhibitor-Refractory Recurrent or Metastatic Head and Neck Squamous-Cell Carcinoma.

Cancer medicine·2026

Same journal

Optimal Weighted Tests for Replication Studies and the 'Two-Trials Rule' With Multiple Hypotheses.

Statistics in medicine·2026

Same journal

Identifiable Copula-Double-Cox Models: A Fully Parametric Framework for Dependent Right-Censored Survival Data.

Statistics in medicine·2026

Same journal

Moving From Individualized Risk-Based Prevention to Benefit-Based Prevention: Estimating Individualized Life-Years Gained From Prevention Services as a Basis for Eligibility.

Statistics in medicine·2026

Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026

Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026

Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 13, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Sample size calculation in three-level cluster randomized trials using generalized estimating equation models.

Jingxia Liu^1,2, Graham A Colditz¹

¹Division of Public Health Sciences, Department of Surgery, Washington University School of Medicine (WUSM), St. Louis, Missouri, USA.

Statistics in Medicine

|July 29, 2020

Summary

This summary is machine-generated.

This study extends generalized estimating equations (GEE) for three-level cluster randomized trials (CRTs) with nested data. It provides methods to accurately estimate treatment effects and accounts for unequal cluster sizes, improving implementation science research.

Keywords:

bias-corrected sandwich estimator cluster randomized trial generalized estimating equation nested correlation structure relative efficiency

More Related Videos

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Related Experiment Videos

Last Updated: Dec 13, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Area of Science:

Implementation Science
Biostatistics
Health Services Research

Background:

Three-level cluster randomized trials (CRTs) are increasingly used in implementation science, generating complex nested data structures.
Existing methods, like generalized estimating equations (GEE) with a nested exchangeable correlation structure, address two-level clustering but require extension for three levels.

Purpose of the Study:

To extend GEE models for analyzing continuous, binary, or count data in three-level CRTs.
To derive and evaluate bias-corrected sandwich estimators for improved treatment effect estimation in three-level CRTs.
To assess the impact of unequal provider and practice sizes on statistical efficiency and propose adjustments.

Main Methods:

Utilized generalized estimating equations (GEE) with a nested exchangeable correlation structure for three-level CRTs.
Derived asymptotic variances for treatment effect estimators across different outcome types.
Extended two bias-corrected sandwich estimators to the three-level CRT context.
Conducted simulation studies to evaluate estimator performance under various provider and practice size distributions.

Main Results:

Provided formulas for asymptotic variances and bias-corrected sandwich estimators in three-level CRTs.
Quantified the relative efficiency (RE) loss due to unequal provider and practice sizes.
Demonstrated the performance of the proposed methods across different size distribution scenarios through simulations.

Conclusions:

The proposed GEE-based methods and bias-corrected estimators enhance the analysis of three-level CRTs.
Understanding and accounting for unequal cluster sizes is crucial for accurate treatment effect estimation and efficient study design.
A method for proposing an increased number of practices to compensate for efficiency loss is presented.