Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reliability and Validity01:29

Reliability and Validity

Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.
Accuracy and Errors in Hypothesis Testing01:13

Accuracy and Errors in Hypothesis Testing

Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5% chance...
Errors In Hypothesis Tests01:14

Errors In Hypothesis Tests

When performing a hypothesis test, there are four possible outcomes depending on the actual truth (or falseness) of the null hypothesis and the decision to reject or not.
Hindsight Biases01:12

Hindsight Biases

Hindsight bias leads you to believe that the event you just experienced was predictable, even though it really wasn’t. In other words, you knew all along that things would turn out the way they did. Can you relate this to the phrase "Hindsight is 20/20" now?
Decision Making: Traditional Method01:14

Decision Making: Traditional Method

The process of hypothesis testing based on the traditional method includes calculating the critical value, testing the value of the test statistic using the sample data, and interpreting these values.
First, a specific claim about the population parameter is decided based on the research question and is stated in a simple form. Further, an opposing statement to this claim is also stated. These statements can act as null and alternative hypotheses, out of which a null hypothesis would be a...
Testing a Claim about Population Proportion01:24

Testing a Claim about Population Proportion

A complete procedure for testing a claim about a population proportion is provided here.
There are two methods of testing a claim about a population proportion: (1) Using the sample proportion from the data where a binomial distribution is approximated to the normal distribution and (2) Using the binomial probabilities calculated from the data.
The first method uses normal distribution as an approximation to the binomial distribution. The requirements are as follows: sample size is large...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Dimensionality Assessment in Forced-Choice Questionnaires: First Steps Toward an Exploratory Framework.

Educational and psychological measurement·2025
Same author

A Systematic Evaluation of Wording Effects Modeling Under the Exploratory Structural Equation Modeling Framework.

Multivariate behavioral research·2025
Same author

The Impact of Social Media Disorder, Family Functioning, and Community Social Disorder on Adolescents' Psychological Distress: The Mediating Role of Intolerance to Uncertainty.

Children (Basel, Switzerland)·2025
Same author

Revisiting the structure of Diagnostic and Statistical Manual of Mental Disorders, fifth edition, Section II personality disorder criteria using individual participant data meta-analysis.

Personality disorders·2025
Same author

Untangling the role of emotion regulation in gambling and video gaming cravings: A replication and extension study.

Addictive behaviors·2025
Same author

Unveiling the association between chronotype and emotional eating in Spanish adolescents: The EHDLA study.

Appetite·2025
Same journal

Compassionate Leadership: Development and Cross-Cultural Validation of Compassion at Work-Leadership Behaviors Inventory (CAW-LBI).

The Spanish journal of psychology·2026
Same journal

Climber Ability and Differences in Psychological, Physiological and Behavioral Responses to an On-sight Lead Climb.

The Spanish journal of psychology·2026
Same journal

The Youth Physical Activity Promotion Model in Spain and Chile: Comparison of Psychological and Social Variables.

The Spanish journal of psychology·2026
Same journal

Tell me Why: The Attributional Styles at Work Questionnaire and its Relationship with Affectivity, Personality, and Motivation.

The Spanish journal of psychology·2026
Same journal

The Indirect Relationship between Prosociality in the Workplace and Employee Well-Being: Testing Multiple Mediators.

The Spanish journal of psychology·2026
Same journal

ICT Use at Work as a Double-Edged Sword: A Moderated Mediation Model of Employee Well-Being.

The Spanish journal of psychology·2026
See all related articles

Related Experiment Video

Updated: May 24, 2026

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Computerized adaptive testing: the capitalization on chance problem.

Julio Olea1, Juan Ramón Barrada, Francisco J Abad

  • 1Facultad de Psicología, Universidad Autónoma de Madrid, 28049-Madrid, Spain. julio.olea@uam.es

The Spanish Journal of Psychology
|March 3, 2012
PubMed
Summary
This summary is machine-generated.

Capitalization on chance significantly impacts item selection and ability estimation in Computerized Adaptive Testing (CAT). This bias is pronounced with smaller calibration samples and larger item bank to test length ratios, affecting precision estimates.

More Related Videos

Advancing Dyslexia Assessment in Children Through Computerized Testing
09:00

Advancing Dyslexia Assessment in Children Through Computerized Testing

Published on: August 16, 2024

A Tactile Automated Passive-Finger Stimulator (TAPS)
19:44

A Tactile Automated Passive-Finger Stimulator (TAPS)

Published on: June 3, 2009

Related Experiment Videos

Last Updated: May 24, 2026

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Advancing Dyslexia Assessment in Children Through Computerized Testing
09:00

Advancing Dyslexia Assessment in Children Through Computerized Testing

Published on: August 16, 2024

A Tactile Automated Passive-Finger Stimulator (TAPS)
19:44

A Tactile Automated Passive-Finger Stimulator (TAPS)

Published on: June 3, 2009

Area of Science:

  • Psychometrics
  • Educational Measurement
  • Computerized Adaptive Testing (CAT)

Background:

  • Computerized Adaptive Testing (CAT) utilizes item response theory (IRT) models for efficient ability estimation.
  • Item selection algorithms in CAT can be susceptible to 'capitalization on chance,' where item parameters are estimated with bias due to sampling variability.
  • The 3-parameter logistic (3PL) model is commonly employed in CAT, but its accuracy depends on robust item parameter estimation.

Purpose of the Study:

  • To investigate the effects of capitalization on chance in item selection and ability estimation within CAT.
  • To examine how calibration sample size and item bank to test length ratio influence estimation errors in CAT.
  • To evaluate the performance of CAT compared to random testing under varying conditions.

Main Methods:

  • Simulation studies were conducted using the 3-parameter logistic (3PL) model.
  • Manipulation of calibration sample sizes (N = 500, 1000, 2000) and item bank size to test length ratios (197/20, 197/40, 788/20, 788/40).
  • Comparison of item selection and ability estimation in CAT versus random test administration.

Main Results:

  • Capitalization on chance was found to be a significant issue in CAT, especially under small calibration sample conditions, leading to large positive bias.
  • Overestimation of precision (asymptotic Standard Error) reached up to 40% for broad ranges of theta in CAT, unlike Root Mean Square Error (RMSE).
  • The problem of capitalization on chance intensified with increasing item bank size to test length ratios.

Conclusions:

  • Capitalization on chance poses a serious threat to the accuracy of ability estimation in CAT, particularly with limited calibration data.
  • The choice of item selection algorithm and exposure control methods are critical for mitigating bias in CAT.
  • Further research into effective exposure control strategies is warranted to improve the reliability of CAT systems.