Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Introduction to Test of Independence

Introduction to Test of Independence

In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:

Assumptions of Survival Analysis

Assumptions of Survival Analysis

Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Comparing the Survival Analysis of Two or More Groups

Comparing the Survival Analysis of Two or More Groups

Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Hypothesis Test for Test of Independence

Hypothesis Test for Test of Independence

The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Adjusted Residuals for Evaluating Conditional Independence in IRT Models for Multistage Adaptive Testing.

Psychometrika·2026

Same author

The Nature and Measure of Critical Thinking: The PACIER Framework and Assessment.

Journal of Intelligence·2025

Same author

An Extended Two-Parameter Logistic Item Response Model to Handle Continuous Responses and Sparse Polytomous Responses.

Psychometrika·2025

Same author

Recent Advances in Alginate Lyase Engineering for Efficient Conversion of Alginate to Value-Added Products.

Microbial biotechnology·2025

Same author

Thermophilic aerobic digestion using aquaculture sludge from rainbow trout aquaculture facilities: effect of salinity.

Frontiers in microbiology·2024

Same author

Direct Itaconate Production from Brown Macroalgae Using Engineered <i>Vibrio</i> sp. dhg.

Journal of agricultural and food chemistry·2024

Same journal

Testing linear hypotheses in repeated measures generalized linear models using external information.

Psychometrika·2026

Same journal

When Do Unifactorial Items Increase the Reliability?

Psychometrika·2026

Same journal

Longitudinal Designs for Diagnostic Models: Identification and Estimation.

Psychometrika·2026

Same journal

Modeling Rare Events and Nonmonotone Nonignorable Missingness of Time-Varying Outcomes and Predictors in Binary Time-Series Daily Diary Data: A Bayesian Selection Model.

Psychometrika·2026

Same journal

Revelle's Beta: The Wait Is Over-Computation Becomes Possible.

Psychometrika·2026

Same journal

On dimensional implication graphs.

Psychometrika·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 11, 2025

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

Adjusted Residuals for Evaluating Conditional Independence in IRT Models for Multistage Adaptive Testing.

Peter W van Rijn¹, Usama S Ali^2,3, Hyo Jeong Shin⁴

¹ETS Global, Amsterdam, The Netherlands. pvanrijn@etsglobal.org.

|November 6, 2023

Summary

This summary is machine-generated.

Multistage adaptive testing (MST) data violate item response theory (IRT) assumptions due to routing. Adjusted residuals are needed for accurate statistical inference in MST, as shown by PISA data analysis.

Keywords:

conditional independence item response theory multistage adaptive testing residual analysis

More Related Videos

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Related Experiment Videos

Last Updated: Jul 11, 2025

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Area of Science:

Statistics
Psychometrics
Educational Measurement

Background:

Item response theory (IRT) models assume conditional independence of item responses given latent ability.
Multistage adaptive testing (MST) designs involve routing decisions that can violate this core IRT assumption.
This violation introduces dependencies in the data, impacting statistical inference.

Purpose of the Study:

To investigate the impact of routing in MST on the conditional independence assumption of IRT models.
To evaluate the appropriateness of generalized residuals for analyzing MST data.
To propose and validate adjustments to statistical methods for MST data.

Main Methods:

Examined the relationship between MST routing and data patterns using concepts from log-linear models.
Assessed the suitability of generalized residuals for item pair frequencies in IRT.
Developed and tested adjusted residuals tailored to specific MST designs through simulation and real data analysis.

Main Results:

Standard generalized residuals are inappropriate for MST data without modifications.
Adjustments to residuals are necessary and depend on the complexity of the MST routing.
The adjusted residuals demonstrated satisfactory Type I error rates in simulations and real data applications.

Conclusions:

The conditional independence assumption in IRT is challenged by MST routing.
Adjusted residuals are crucial for valid statistical inference in MST.
Findings have implications for interpreting results from large-scale assessments like PISA.