Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Response Surface Methodology01:16

Response Surface Methodology

127
Response Surface Methodology (RSM) is a collection of statistical and mathematical techniques used to develop, improve, and optimize processes. It is particularly valuable when many input variables or factors potentially influence a response variable.
The process of RSM involves several key steps:
127
Modeling in Therapy01:26

Modeling in Therapy

71
Modeling, a key technique in therapy, uses observational learning to help clients acquire and practice new skills by watching therapists demonstrate desired behaviors. This approach, rooted in Albert Bandura's concept of vicarious learning, plays a significant role in therapeutic interventions for various psychological conditions, including social anxiety, ADHD, and depression.
Participant Modeling
Participant modeling involves therapists demonstrating calm and effective behaviors in...
71
Reliability and Validity01:29

Reliability and Validity

12.7K
Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.
12.7K
Dose-Response Relationship: Overview01:03

Dose-Response Relationship: Overview

3.1K
Agonists can bind with and activate receptors, resulting in the formation of drug-receptor complexes. Once formed, these complexes catalyze many biochemical processes at the cellular level and subsequently induce a pharmacologic response. The degree of response is directly proportional to the fraction of activated receptors, which in turn, depends on the concentration of the drug at the receptor site as well as the sensitivity of the receptor. An increase in the administered dose contributes to...
3.1K
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

3.3K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
3.3K
Self-Evaluation: Self-Enhancement and Self-Verification03:00

Self-Evaluation: Self-Enhancement and Self-Verification

5.2K
Social psychologists have documented that feeling good about ourselves and maintaining positive self-esteem is a powerful motivator of human behavior (Tavris & Aronson, 2008). In the United States, members of the predominant culture typically think very highly of themselves and view themselves as good people who are above average on many desirable traits (Ehrlinger, Gilovich, & Ross, 2005). Often, our behavior, attitudes, and beliefs are affected when we experience a threat to our...
5.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Use of Crossed Random-Effects Models to Assess Multiple-Choice Items: An Experimental Study.

The Spanish journal of psychology·2026
Same author

Evaluating the Performance of R-Squared Measures in Multilevel Models.

Multivariate behavioral research·2026
Same author

A qualitative exploration of video-based motor action observation perceptions in patients with chronic low back pain and asymptomatic participants: An interpretative phenomenological analysis.

PloS one·2026
Same author

Dimensionality Assessment in Forced-Choice Questionnaires: First Steps Toward an Exploratory Framework.

Educational and psychological measurement·2025
Same author

A general diagnostic modelling framework for forced-choice assessments.

The British journal of mathematical and statistical psychology·2025
Same author

Cross-validation and predictive metrics in psychological research: Do not leave out the leave-one-out.

Behavior research methods·2025
Same journal

Personal Recovery in Addictions: Development of a new Assessment Instrument.

Psicothema·2026
Same journal

Internet Habits, Problematic Internet Use, and Online Risk Practices Among Adolescents With ADHD in Spain.

Psicothema·2026
Same journal

Relationship Between Social Connectedness and Quality of Life in Older Adults: An Examination of Sex Differences.

Psicothema·2026
Same journal

The Influence of Gender on Measuring Mental Health Stigma. A Cross-Sectional Vignette Study With the Attribution Questionnaire 9.

Psicothema·2026
Same journal

Comprehensive Assessment of Mental Health Stigma.

Psicothema·2026
Same journal

Psychometric Properties of the Teachers' Responses to Bullying Questionnaire (TRBQ) in Spanish Students.

Psicothema·2026
See all related articles

Related Experiment Video

Updated: Jun 27, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

756

Enhancing Content Validity Assessment With Item Response Theory Modeling.

Rodrigo Schames Kreitchmann1, Pablo Nájera2, Susana Sanz2

  • 1Universidad Nacional de Educación a Distancia (Spain).

Psicothema
|April 25, 2024
PubMed
Summary
This summary is machine-generated.

Item response theory (IRT) enhances subject matter expert (SME) assessments by improving item relevance and predicting factor loadings. This method offers superior validity evidence compared to traditional indices.

More Related Videos

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.2K
Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

5.8K

Related Experiment Videos

Last Updated: Jun 27, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

756
Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.2K
Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

5.8K

Area of Science:

  • Psychometrics
  • Educational Measurement
  • Assessment Validity

Background:

  • Ensuring assessment validity relies on rigorous test content evaluation.
  • Subject matter experts (SMEs) commonly assess item relevance, representativeness, and appropriateness.
  • This study introduces item response theory (IRT) for SME model assessments.

Purpose of the Study:

  • To integrate IRT into SME assessments for evaluating item relevance and SME performance.
  • To compare IRT-based validity with traditional indices like content validity index and Aiken's V.
  • To assess SME accuracy in identifying trait-specific items and predicting factor loadings.

Main Methods:

  • Item response theory (IRT) was applied to model SME evaluations.
  • IRT-derived SME parameters (discrimination, threshold) were estimated.
  • IRT scores were compared against content validity index and Aiken's V for predictive accuracy.

Main Results:

  • IRT scores accurately identified conscientiousness items (R2 = 0.57) and predicted factor loadings (R2 = 0.45).
  • IRT demonstrated incremental validity, explaining 11-17% more variance than traditional indices.
  • IRT effectively detected suboptimal SME performance in item evaluation.

Conclusions:

  • Modeling SME assessments with IRT enhances item alignment with intended constructs.
  • IRT provides improved prediction of factor loadings, strengthening content validity.
  • This approach facilitates the development of more valid measurement instruments.