Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Errors In Hypothesis Tests01:14

Errors In Hypothesis Tests

5.8K
When performing a hypothesis test, there are four possible outcomes depending on the actual truth (or falseness) of the null hypothesis and the decision to reject or not.
5.8K
Cluster Sampling Method01:20

Cluster Sampling Method

13.9K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
13.9K
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

425
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
425
Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

382
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
382
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.4K
Statistical Hypothesis Testing01:16

Statistical Hypothesis Testing

6.1K
Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...
6.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Equity in law enforcement actions following a school threat assessment.

Law and human behavior·2025
Same author

The Impact of Restorative Practices on the Use of Out-of-School Suspensions: Results from a Cluster Randomized Controlled Trial.

Prevention science : the official journal of the Society for Prevention Research·2023
Same author

Accounting for Heteroskedasticity Resulting from Between-Group Differences in Multilevel Models.

Multivariate behavioral research·2022
Same author

Using cluster-robust standard errors when analyzing group-randomized trials with few clusters.

Behavior research methods·2021
Same author

Alternatives to Logistic Regression Models when Analyzing Cluster Randomized Trials with Binary Outcomes.

Prevention science : the official journal of the Society for Prevention Research·2021
Same author

An investigation of the psychometric properties of the early identification system-student report in a middle school sample.

School psychology (Washington, D.C.)·2020
Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026
Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026
Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026
Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026
Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026
Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026
See all related articles

Related Experiment Video

Updated: Jan 8, 2026

Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

39.8K

When Cluster-Robust Inferences Fail.

Francis Huang1,2

  • 1University of Missouri, Columbia, USA.

Educational and Psychological Measurement
|December 22, 2025
PubMed
Summary
This summary is machine-generated.

Cluster-robust standard errors (CRSEs) can fail in nested data, especially with imbalanced clusters. Alternative estimators (CR2, CR3) and df adjustments maintain Type I error rates, with CR1 and effective cluster size df also being acceptable.

Keywords:
cluster robust standard errorsclustered datadegrees of freedomeffective sample size

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.3K
Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes
05:07

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

313

Related Experiment Videos

Last Updated: Jan 8, 2026

Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

39.8K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.3K
Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes
05:07

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

313

Area of Science:

  • Statistics
  • Educational Research
  • Data Analysis

Background:

  • Cluster-robust standard errors (CRSEs) are widely used for nested data but can fail to maintain Type I error rates.
  • Issues arise particularly with imbalanced cluster sizes, common in educational datasets.
  • Accurate statistical inference is crucial when using cluster-level predictors.

Purpose of the Study:

  • To investigate conditions where CRSEs fail to maintain Type I error rates.
  • To evaluate alternative estimators and degrees of freedom (df) adjustments.
  • To assess the performance of different CRSE methods with continuous and dichotomous predictors.

Main Methods:

  • A Monte Carlo simulation was employed to test various scenarios.
  • Evaluated the traditional CRSE (CR1) estimator.
  • Assessed bias-reduced linearization (CR2) and jackknife (CR3) estimators with df adjustments.

Main Results:

  • CR2 and CR3 estimators with df adjustments were generally effective in maintaining Type I error rates.
  • The traditional CR1 estimator paired with df based on effective cluster size was also acceptable.
  • Performance varied depending on specific data characteristics and predictor types.

Conclusions:

  • Alternative CRSE estimators and df adjustments can effectively address Type I error rate issues in nested data.
  • Careful consideration of dataset characteristics, such as cluster size balance, is essential for reliable statistical inference.
  • Accurate reporting of nested data structures is vital for the appropriate application of CRSEs.