Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Errors In Hypothesis Tests

Errors In Hypothesis Tests

When performing a hypothesis test, there are four possible outcomes depending on the actual truth (or falseness) of the null hypothesis and the decision to reject or not.

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

Assumptions of Survival Analysis

Assumptions of Survival Analysis

Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Statistical Hypothesis Testing

Statistical Hypothesis Testing

Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Equity in law enforcement actions following a school threat assessment.

Law and human behavior·2025

Same author

The Impact of Restorative Practices on the Use of Out-of-School Suspensions: Results from a Cluster Randomized Controlled Trial.

Prevention science : the official journal of the Society for Prevention Research·2023

Same author

Accounting for Heteroskedasticity Resulting from Between-Group Differences in Multilevel Models.

Multivariate behavioral research·2022

Same author

Using cluster-robust standard errors when analyzing group-randomized trials with few clusters.

Behavior research methods·2021

Same author

Alternatives to Logistic Regression Models when Analyzing Cluster Randomized Trials with Binary Outcomes.

Prevention science : the official journal of the Society for Prevention Research·2021

Same author

An investigation of the psychometric properties of the early identification system-student report in a middle school sample.

School psychology (Washington, D.C.)·2020

Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026

Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026

Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026

Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026

Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026

Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 8, 2026

Infinium Assay for Large-scale SNP Genotyping Applications

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

When Cluster-Robust Inferences Fail.

Francis Huang^1,2

¹University of Missouri, Columbia, USA.

Educational and Psychological Measurement

|December 22, 2025

Summary

This summary is machine-generated.

Cluster-robust standard errors (CRSEs) can fail in nested data, especially with imbalanced clusters. Alternative estimators (CR2, CR3) and df adjustments maintain Type I error rates, with CR1 and effective cluster size df also being acceptable.

Keywords:

cluster robust standard errors clustered data degrees of freedom effective sample size

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

Related Experiment Videos

Last Updated: Jan 8, 2026

Infinium Assay for Large-scale SNP Genotyping Applications

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

Area of Science:

Statistics
Educational Research
Data Analysis

Background:

Cluster-robust standard errors (CRSEs) are widely used for nested data but can fail to maintain Type I error rates.
Issues arise particularly with imbalanced cluster sizes, common in educational datasets.
Accurate statistical inference is crucial when using cluster-level predictors.

Purpose of the Study:

To investigate conditions where CRSEs fail to maintain Type I error rates.
To evaluate alternative estimators and degrees of freedom (df) adjustments.
To assess the performance of different CRSE methods with continuous and dichotomous predictors.

Main Methods:

A Monte Carlo simulation was employed to test various scenarios.
Evaluated the traditional CRSE (CR1) estimator.
Assessed bias-reduced linearization (CR2) and jackknife (CR3) estimators with df adjustments.

Main Results:

CR2 and CR3 estimators with df adjustments were generally effective in maintaining Type I error rates.
The traditional CR1 estimator paired with df based on effective cluster size was also acceptable.
Performance varied depending on specific data characteristics and predictor types.

Conclusions:

Alternative CRSE estimators and df adjustments can effectively address Type I error rate issues in nested data.
Careful consideration of dataset characteristics, such as cluster size balance, is essential for reliable statistical inference.
Accurate reporting of nested data structures is vital for the appropriate application of CRSEs.