Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Jun 16, 2026

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Xinyu Liu1, David J Weiss1

  • 1University of Minnesota, Minneapolis, USA.

Educational and Psychological Measurement
|June 15, 2026
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Constructing a binary prediction model with incomplete data: Variable selection to balance fairness and precision.

Psychological methods·2025
Same author

Using Machine Learning to Identify Social Determinants of Health that Impact Discharge Disposition for Hospitalized Patients.

Journal of the American Medical Directors Association·2025
Same author

Adaptive Measurement of Change in the Context of Item Parameter Drift.

Applied psychological measurement·2025
Same author

Termination Criteria for Grid Multiclassification Adaptive Testing With Multidimensional Polytomous Items.

Applied psychological measurement·2022
Same author

Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error.

Educational and psychological measurement·2022
Same author

Multidimensional Computerized Adaptive Testing: A Potential Path Toward the Efficient and Precise Assessment of Applied Cognition, Daily Activity, and Mobility for Hospitalized Patients.

Archives of physical medicine and rehabilitation·2022
Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026
Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026
Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026
Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026
Same journal

The Anonymous Collection of Longitudinal Data: An Evaluation of Self-Generated Identification Codes and Methodological Challenges.

Educational and psychological measurement·2026
Same journal

Beyond One-Size-Fits-All: A Differential Sensitivity Framework for Machine Learning-Based Detection of Anomalous Survey Responses.

Educational and psychological measurement·2026
See all related articles

Choosing the right estimator and stopping rule is crucial for efficient computerized adaptive testing (CAT). Weighted Likelihood Estimation (WLE) with standard error of measurement (SEM) or fixed-length rules works best for high-quality item banks, while other rules are better for lower-quality ones.

Area of Science:

  • Psychometrics
  • Educational Measurement
  • Computerized Adaptive Testing (CAT)

Background:

  • Computerized adaptive testing (CAT) optimizes measurement by tailoring items to examinees.
  • The effectiveness of CAT relies on ability estimators and termination criteria.
  • Limited research exists on the interaction of these components across diverse item banks.

Purpose of the Study:

  • To investigate the interactive effects of ability estimators and termination criteria on CAT performance.
  • To evaluate these interactions across varying item bank sizes and information distributions.
  • To identify optimal CAT configurations for different item bank characteristics.

Main Methods:

  • Simulation study evaluating four ability estimators (MLE, WLE, MAP, EAP).
Keywords:
ability estimationcomputerized adaptive testingitem bank characteristicsitem response theorymeasurement precisionstopping rules

More Related Videos

A Computerized Functional Skills Assessment and Training Program Targeting Technology Based Everyday Functional Skills
07:31

A Computerized Functional Skills Assessment and Training Program Targeting Technology Based Everyday Functional Skills

Published on: February 13, 2020

Related Experiment Videos

Last Updated: Jun 16, 2026

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

A Computerized Functional Skills Assessment and Training Program Targeting Technology Based Everyday Functional Skills
07:31

A Computerized Functional Skills Assessment and Training Program Targeting Technology Based Everyday Functional Skills

Published on: February 13, 2020

  • Assessed four termination criteria (fixed-length, SEM, MI, Δθ).
  • Tested across low- (100-item) and high- (500-item) information banks with flat and peaked distributions using the three-parameter logistic model.
  • Main Results:

    • Optimal CAT configuration depends on item bank size and shape.
    • WLE proved the most robust estimator, mitigating MLE boundary issues and Bayesian shrinkage bias.
    • In high-information banks, SEM and fixed-length rules minimized bias and RMSE.
    • In low-information peaked banks, the Δθ rule with WLE balanced accuracy and efficiency, avoiding inefficient test elongation.

    Conclusions:

    • No single CAT design fits all scenarios; optimization is context-dependent.
    • For high-quality item banks, WLE with SEM or fixed-length rules is recommended.
    • For lower-quality banks, Δθ or hybrid SEM rules are advised to prevent inefficient test length.