Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reliability and Validity

Reliability and Validity

Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.

Accuracy and Errors in Hypothesis Testing

Accuracy and Errors in Hypothesis Testing

Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5% chance...

Errors In Hypothesis Tests

Errors In Hypothesis Tests

When performing a hypothesis test, there are four possible outcomes depending on the actual truth (or falseness) of the null hypothesis and the decision to reject or not.

Hindsight Biases

Hindsight Biases

Hindsight bias leads you to believe that the event you just experienced was predictable, even though it really wasn’t. In other words, you knew all along that things would turn out the way they did. Can you relate this to the phrase "Hindsight is 20/20" now?

Decision Making: Traditional Method

Decision Making: Traditional Method

The process of hypothesis testing based on the traditional method includes calculating the critical value, testing the value of the test statistic using the sample data, and interpreting these values.
First, a specific claim about the population parameter is decided based on the research question and is stated in a simple form. Further, an opposing statement to this claim is also stated. These statements can act as null and alternative hypotheses, out of which a null hypothesis would be a...

Testing a Claim about Population Proportion

Testing a Claim about Population Proportion

A complete procedure for testing a claim about a population proportion is provided here.
There are two methods of testing a claim about a population proportion: (1) Using the sample proportion from the data where a binomial distribution is approximated to the normal distribution and (2) Using the binomial probabilities calculated from the data.
The first method uses normal distribution as an approximation to the binomial distribution. The requirements are as follows: sample size is large...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Dimensionality Assessment in Forced-Choice Questionnaires: First Steps Toward an Exploratory Framework.

Educational and psychological measurement·2025

Same author

A Systematic Evaluation of Wording Effects Modeling Under the Exploratory Structural Equation Modeling Framework.

Multivariate behavioral research·2025

Same author

The Impact of Social Media Disorder, Family Functioning, and Community Social Disorder on Adolescents' Psychological Distress: The Mediating Role of Intolerance to Uncertainty.

Children (Basel, Switzerland)·2025

Same author

Revisiting the structure of Diagnostic and Statistical Manual of Mental Disorders, fifth edition, Section II personality disorder criteria using individual participant data meta-analysis.

Personality disorders·2025

Same author

Untangling the role of emotion regulation in gambling and video gaming cravings: A replication and extension study.

Addictive behaviors·2025

Same author

Unveiling the association between chronotype and emotional eating in Spanish adolescents: The EHDLA study.

Appetite·2025

Same journal

Compassionate Leadership: Development and Cross-Cultural Validation of Compassion at Work-Leadership Behaviors Inventory (CAW-LBI).

The Spanish journal of psychology·2026

Same journal

Climber Ability and Differences in Psychological, Physiological and Behavioral Responses to an On-sight Lead Climb.

The Spanish journal of psychology·2026

Same journal

The Youth Physical Activity Promotion Model in Spain and Chile: Comparison of Psychological and Social Variables.

The Spanish journal of psychology·2026

Same journal

Tell me Why: The Attributional Styles at Work Questionnaire and its Relationship with Affectivity, Personality, and Motivation.

The Spanish journal of psychology·2026

Same journal

The Indirect Relationship between Prosociality in the Workplace and Employee Well-Being: Testing Multiple Mediators.

The Spanish journal of psychology·2026

Same journal

ICT Use at Work as a Double-Edged Sword: A Moderated Mediation Model of Employee Well-Being.

The Spanish journal of psychology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 24, 2026

Computerized Adaptive Testing System of Functional Assessment of Stroke

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Computerized adaptive testing: the capitalization on chance problem.

Julio Olea¹, Juan Ramón Barrada, Francisco J Abad

¹Facultad de Psicología, Universidad Autónoma de Madrid, 28049-Madrid, Spain. julio.olea@uam.es

The Spanish Journal of Psychology

|March 3, 2012

Summary

This summary is machine-generated.

Capitalization on chance significantly impacts item selection and ability estimation in Computerized Adaptive Testing (CAT). This bias is pronounced with smaller calibration samples and larger item bank to test length ratios, affecting precision estimates.

More Related Videos

Advancing Dyslexia Assessment in Children Through Computerized Testing

Advancing Dyslexia Assessment in Children Through Computerized Testing

Published on: August 16, 2024

A Tactile Automated Passive-Finger Stimulator (TAPS)

A Tactile Automated Passive-Finger Stimulator (TAPS)

Published on: June 3, 2009

Related Experiment Videos

Last Updated: May 24, 2026

Computerized Adaptive Testing System of Functional Assessment of Stroke

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Advancing Dyslexia Assessment in Children Through Computerized Testing

Advancing Dyslexia Assessment in Children Through Computerized Testing

Published on: August 16, 2024

A Tactile Automated Passive-Finger Stimulator (TAPS)

A Tactile Automated Passive-Finger Stimulator (TAPS)

Published on: June 3, 2009

Area of Science:

Psychometrics
Educational Measurement
Computerized Adaptive Testing (CAT)

Background:

Computerized Adaptive Testing (CAT) utilizes item response theory (IRT) models for efficient ability estimation.
Item selection algorithms in CAT can be susceptible to 'capitalization on chance,' where item parameters are estimated with bias due to sampling variability.
The 3-parameter logistic (3PL) model is commonly employed in CAT, but its accuracy depends on robust item parameter estimation.

Purpose of the Study:

To investigate the effects of capitalization on chance in item selection and ability estimation within CAT.
To examine how calibration sample size and item bank to test length ratio influence estimation errors in CAT.
To evaluate the performance of CAT compared to random testing under varying conditions.

Main Methods:

Simulation studies were conducted using the 3-parameter logistic (3PL) model.
Manipulation of calibration sample sizes (N = 500, 1000, 2000) and item bank size to test length ratios (197/20, 197/40, 788/20, 788/40).
Comparison of item selection and ability estimation in CAT versus random test administration.

Main Results:

Capitalization on chance was found to be a significant issue in CAT, especially under small calibration sample conditions, leading to large positive bias.
Overestimation of precision (asymptotic Standard Error) reached up to 40% for broad ranges of theta in CAT, unlike Root Mean Square Error (RMSE).
The problem of capitalization on chance intensified with increasing item bank size to test length ratios.

Conclusions:

Capitalization on chance poses a serious threat to the accuracy of ability estimation in CAT, particularly with limited calibration data.
The choice of item selection algorithm and exposure control methods are critical for mitigating bias in CAT.
Further research into effective exposure control strategies is warranted to improve the reliability of CAT systems.