Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Comparison Tests01:13

Multiple Comparison Tests

4.1K
Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
4.1K
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

351
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
351
Bonferroni Test01:10

Bonferroni Test

3.0K
The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...
3.0K
Wilcoxon Signed-Ranks Test for Matched Pairs01:09

Wilcoxon Signed-Ranks Test for Matched Pairs

278
The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...
278
Self-Report Tests of Personality01:22

Self-Report Tests of Personality

538
Self-report inventories are objective personality assessments that use multiple-choice items or numbered scales, typically ranging from 1 (strongly disagree) to 5 (strongly agree). They are often called Likert scales after Rensis Likert. These inventories are widely used due to their ease of administration and cost-effectiveness. One of the most prominent examples is the Minnesota Multiphasic Personality Inventory (MMPI), initially developed in the 1940s to assess abnormal personality traits.
538
Kendall's Tau Test01:16

Kendall's Tau Test

916
Kendall's tau test, also known as the Kendall rank coefficient test, is a nonparametric method for assessing association between two variables. This test is particularly useful for identifying significant correlations when the distributions of the sample and population are unknown. Developed in 1938 by the British statistician Sir Maurice George Kendall, the tau coefficient (denoted as τ) serves as a rank correlation coefficient, with values ranging from -1 to +1.
A τ value of +1 indicates...
916

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Computerized Adaptive Testing for Ipsative Tests with Multidimensional Pairwise-Comparison Items: Algorithm Development and Applications.

Applied psychological measurement·2022
Same author

An Empirical Q-Matrix Validation Method for the Polytomous G-DINA Model.

Psychometrika·2021
Same author

A new item response theory model for rater centrality using a hierarchical rater model approach.

Behavior research methods·2021
Same author

Assessment of Differential Statement Functioning in Ipsative Tests With Multidimensional Forced-Choice Items.

Applied psychological measurement·2021
Same author

Measuring dynamic goals for marriage: Development and validation of the Marital Goal Scale using Rasch modeling.

Psychological assessment·2019
Same author

Item Response Theory Modeling for Examinee-selected Items with Rater Effect.

Applied psychological measurement·2019
Same journal

Development of a Short Form of the CPAI-A (Form B) with Rasch Analyses.

Journal of applied measurement·2021
Same journal

Evaluating the Impact of Multidimensionality on Type I and Type II Error Rates using the Q-Index Item Fit Statistic for the Rasch Model.

Journal of applied measurement·2021
Same journal

Diabetes Distress in Emerging Adults: Refining the Problem Areas in Diabetes-Emerging Adult Version using Rasch Analysis.

Journal of applied measurement·2021
Same journal

A Psychometric Replication of Fan (1998) Item Response Theory and Classical Test Theory: An Empirical Comparison of their Item/Person Statistics.

Journal of applied measurement·2021
Same journal

The Development of the Mental Toughness Situational Judgment Test: A Novel Approach to Assessing Mental Toughness.

Journal of applied measurement·2021
Same journal

Using the Rasch Model to Measure Comprehension of Fraction Addition.

Journal of applied measurement·2021
See all related articles

Related Experiment Video

Updated: Nov 5, 2025

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

6.0K

Assessing Differential Statement Functioning in Polytomous Multidimensional Pairwise Comparison Items.

Xue-Lan Qiu1

  • 1Xue-Lan Qiu, Faculty of Education, The University of Hong Kong, Pokfulam, Hong Kong, xlqiu@hku.edu.

Journal of Applied Measurement
|May 13, 2021
PubMed
Summary
This summary is machine-generated.

Differential statement functioning (DSF) in multidimensional pairwise comparison (MPC) items can be accurately detected using the constant-statement (CS) method. This approach, based on a Rasch model, offers reliable assessment for career interest and personality evaluations.

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.7K
A Two-interval Forced-choice Task for Multisensory Comparisons
07:13

A Two-interval Forced-choice Task for Multisensory Comparisons

Published on: November 9, 2018

11.2K

Related Experiment Videos

Last Updated: Nov 5, 2025

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

6.0K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.7K
A Two-interval Forced-choice Task for Multisensory Comparisons
07:13

A Two-interval Forced-choice Task for Multisensory Comparisons

Published on: November 9, 2018

11.2K

Area of Science:

  • Educational Psychology
  • Psychometrics

Background:

  • Multidimensional pairwise comparison (MPC) items are prevalent in educational settings for assessing career interest, values, and personality, aiming to mitigate response biases.
  • Differential statement functioning (DSF) occurs when an MPC item statement exhibits varying utility across different demographic groups, a phenomenon under-researched.
  • Existing methods for detecting DSF in MPC items are limited, necessitating the development of robust analytical approaches.

Purpose of the Study:

  • To adapt and evaluate three methods—equal-mean-utility (EMU), all-other-statement (AOS), and constant-statement (CS)—for detecting differential statement functioning (DSF) in polytomous multidimensional pairwise comparison (MPC) items.
  • To assess the performance of these methods using a simulation study, focusing on parameter recovery and DSF detection accuracy.
  • To provide an empirical application of the proposed methods in the context of career interest assessment.

Main Methods:

  • The study adapted three DSF detection methods (EMU, AOS, CS) for polytomous MPC items, grounded in a Rasch model framework.
  • A simulation study was employed to evaluate the accuracy of parameter recovery and the effectiveness of the proposed DSF detection methods under various conditions.
  • The constant-statement (CS) method was specifically investigated, utilizing DSF-free statements as anchors.

Main Results:

  • Simulation results indicated that the constant-statement (CS) method demonstrates superior performance in accurately estimating parameters and detecting differential statement functioning (DSF) when DSF statements are present.
  • The CS method, particularly when employing one or more DSF-free statements as anchors, yielded the most accurate results in the simulation study.
  • The study successfully applied the developed methods to an empirical dataset concerning career interest assessment, validating their practical utility.

Conclusions:

  • The constant-statement (CS) method is recommended for detecting differential statement functioning (DSF) in polytomous multidimensional pairwise comparison (MPC) items due to its accuracy and robust performance.
  • The findings underscore the importance of assessing DSF in MPC items to ensure measurement invariance across different groups in educational and psychological assessments.
  • The adapted Rasch model-based methods, especially the CS approach, provide valuable tools for researchers and practitioners evaluating career interest, personality, and values.