Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Introduction to Test of Independence01:21

Introduction to Test of Independence

2.3K
In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:
2.3K
Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

136
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
136
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

207
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
207
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

200
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
200
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

2.5K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
2.5K
Hypothesis Test for Test of Independence01:16

Hypothesis Test for Test of Independence

3.6K
The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...
3.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adjusted Residuals for Evaluating Conditional Independence in IRT Models for Multistage Adaptive Testing.

Psychometrika·2026
Same author

The Nature and Measure of Critical Thinking: The PACIER Framework and Assessment.

Journal of Intelligence·2025
Same author

An Extended Two-Parameter Logistic Item Response Model to Handle Continuous Responses and Sparse Polytomous Responses.

Psychometrika·2025
Same author

Recent Advances in Alginate Lyase Engineering for Efficient Conversion of Alginate to Value-Added Products.

Microbial biotechnology·2025
Same author

Thermophilic aerobic digestion using aquaculture sludge from rainbow trout aquaculture facilities: effect of salinity.

Frontiers in microbiology·2024
Same author

Direct Itaconate Production from Brown Macroalgae Using Engineered <i>Vibrio</i> sp. dhg.

Journal of agricultural and food chemistry·2024
Same journal

Testing linear hypotheses in repeated measures generalized linear models using external information.

Psychometrika·2026
Same journal

When Do Unifactorial Items Increase the Reliability?

Psychometrika·2026
Same journal

Longitudinal Designs for Diagnostic Models: Identification and Estimation.

Psychometrika·2026
Same journal

Modeling Rare Events and Nonmonotone Nonignorable Missingness of Time-Varying Outcomes and Predictors in Binary Time-Series Daily Diary Data: A Bayesian Selection Model.

Psychometrika·2026
Same journal

Revelle's Beta: The Wait Is Over-Computation Becomes Possible.

Psychometrika·2026
Same journal

On dimensional implication graphs.

Psychometrika·2026
See all related articles

Related Experiment Video

Updated: Jul 11, 2025

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.2K

Adjusted Residuals for Evaluating Conditional Independence in IRT Models for Multistage Adaptive Testing.

Peter W van Rijn1, Usama S Ali2,3, Hyo Jeong Shin4

  • 1ETS Global, Amsterdam, The Netherlands. pvanrijn@etsglobal.org.

Psychometrika
|November 6, 2023
PubMed
Summary
This summary is machine-generated.

Multistage adaptive testing (MST) data violate item response theory (IRT) assumptions due to routing. Adjusted residuals are needed for accurate statistical inference in MST, as shown by PISA data analysis.

Keywords:
conditional independenceitem response theorymultistage adaptive testingresidual analysis

More Related Videos

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.1K
The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

5.9K

Related Experiment Videos

Last Updated: Jul 11, 2025

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.2K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.1K
The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

5.9K

Area of Science:

  • Statistics
  • Psychometrics
  • Educational Measurement

Background:

  • Item response theory (IRT) models assume conditional independence of item responses given latent ability.
  • Multistage adaptive testing (MST) designs involve routing decisions that can violate this core IRT assumption.
  • This violation introduces dependencies in the data, impacting statistical inference.

Purpose of the Study:

  • To investigate the impact of routing in MST on the conditional independence assumption of IRT models.
  • To evaluate the appropriateness of generalized residuals for analyzing MST data.
  • To propose and validate adjustments to statistical methods for MST data.

Main Methods:

  • Examined the relationship between MST routing and data patterns using concepts from log-linear models.
  • Assessed the suitability of generalized residuals for item pair frequencies in IRT.
  • Developed and tested adjusted residuals tailored to specific MST designs through simulation and real data analysis.

Main Results:

  • Standard generalized residuals are inappropriate for MST data without modifications.
  • Adjustments to residuals are necessary and depend on the complexity of the MST routing.
  • The adjusted residuals demonstrated satisfactory Type I error rates in simulations and real data applications.

Conclusions:

  • The conditional independence assumption in IRT is challenged by MST routing.
  • Adjusted residuals are crucial for valid statistical inference in MST.
  • Findings have implications for interpreting results from large-scale assessments like PISA.