Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Test for Homogeneity

Test for Homogeneity

The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...

Cochran's Q Test

Cochran's Q Test

Cochran's Q Test is a nonparametric statistical test used to determine if there are potential differences in the outcomes of three or more related groups on a binary (yes/no) or dichotomous outcome. It is essentially an extension of the McNemar Test, which is limited to two related samples - Cochran's Q test can handle three or more related samples, making it more versatile in scenarios where subjects are measured under multiple conditions. The test statistic follows a Chi-Square...

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with...

Goodness-of-Fit Test

Goodness-of-Fit Test

The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...

Wald-Wolfowitz Runs Test I

Wald-Wolfowitz Runs Test I

The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Combining regularization and logistic regression model to validate the Q-matrix for cognitive diagnosis model.

The British journal of mathematical and statistical psychology·2024

Same author

A New Method to Balance Measurement Accuracy and Attribute Coverage in Cognitive Diagnostic Computerized Adaptive Testing.

Applied psychological measurement·2021

Same author

[Correlations between DNA mismatch repair (MMR) and prognosis and prediction of treatment efficacy in stage II/II colon cancer].

Zhonghua zhong liu za zhi [Chinese journal of oncology]·2015

Same author

[Clinical effect of hemoperfusion combined with hemodialysis in treatment of severe organophosphate pesticide poisoning].

Zhonghua lao dong wei sheng zhi ye bing za zhi = Zhonghua laodong weisheng zhiyebing zazhi = Chinese journal of industrial hygiene and occupational diseases·2015

Same author

Effect of orexin A on apoptosis in BGC-823 gastric cancer cells via OX1R through the AKT signaling pathway.

Molecular medicine reports·2015

Same author

Time-resolved dynamic dilution introduction for ion mobility spectrometry and its application in end-tidal propofol monitoring.

Journal of breath research·2015

Same journal

babebi: An R Package for Bayesian Estimation and Validation in Small-N Two-Rater Pre-Post Designs.

Applied psychological measurement·2026

Same journal

A Tool for Agreement and Alignment Analysis in Binary Rating Tasks: The R Package scindex.

Applied psychological measurement·2026

Same journal

The EM Algorithm and Its Variants in Cognitive Diagnostic Models: Comparing Their Propensity for Boundaries, Extremes, Convergence, and Suboptimal Solutions.

Applied psychological measurement·2026

Same journal

When Perceptions of Social Desirability Differ: Implications for the Multidimensional Nominal Response Model of Faking.

Applied psychological measurement·2026

Same journal

csemGT: An R Package for Estimating Raw-Score Conditional Standard Errors of Measurement in Generalizability Theory.

Applied psychological measurement·2026

Same journal

Confirmatory Factor Analysis with Adaptive Quadrature Estimator Using Four Link Functions.

Applied psychological measurement·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 27, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Using a Generalized Logistic Regression Method to Detect Differential Item Functioning With Multiple Groups in

Xiaojian Sun^1,2, Shimeng Wang³, Lei Guo⁴

¹School of Mathematics and Statistics, Southwest University, Chongqing, China.

Applied Psychological Measurement

|June 7, 2023

Summary

This summary is machine-generated.

Differential item functioning (DIF) compromises test validity. This study introduces generalized logistic regression (GLR) methods to detect DIF in multiple groups within cognitive diagnostic assessment, outperforming traditional methods.

Keywords:

cognitive diagnostic assessment differential item functioning generalized logistic regression multiple groups

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

Related Experiment Videos

Last Updated: Jul 27, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

Area of Science:

Psychometrics
Educational Measurement
Cognitive Diagnostic Assessment (CDA)

Background:

Differential item functioning (DIF) can undermine the validity and fairness of assessments.
Existing DIF detection methods primarily focus on two-group comparisons, limiting their application in complex, multi-group scenarios.
Detecting DIF in cognitive diagnostic assessment (CDA) across multiple groups remains an under-researched area.

Purpose of the Study:

To investigate the efficacy of generalized logistic regression (GLR) methods for detecting DIF in multi-group CDA settings.
To compare the performance of GLR-based Wald test (GLR-Wald) and GLR-based likelihood ratio test (GLR-LRT) against the ordinary Wald test.
To evaluate the impact of using estimated attribute profiles as matching criteria in GLR-based DIF detection.

Main Methods:

Employed generalized logistic regression (GLR) to detect DIF items.
Utilized estimated attribute profiles as matching criteria within the GLR framework.
Conducted a simulation study comparing GLR-Wald, GLR-LRT, and the ordinary Wald test, supplemented by a real data analysis.

Main Results:

Both GLR-Wald and GLR-LRT demonstrated superior control of Type I error rates compared to the ordinary Wald test across most simulated conditions.
The GLR methods yielded higher empirical rejection rates, indicating greater power in detecting DIF items than the ordinary Wald test.
Employing estimated attribute profiles as matching criteria resulted in comparable Type I error rates and empirical rejection rates for GLR-Wald and GLR-LRT.

Conclusions:

The GLR method, particularly with estimated attribute profiles, offers a robust approach for detecting DIF in multi-group CDA.
GLR-based tests (Wald and LRT) provide improved accuracy and power for DIF detection compared to the ordinary Wald test in multi-group contexts.
These findings support the application of GLR methods for enhancing the validity and fairness of cognitive diagnostic assessments across diverse groups.