Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Sieve Analysis and Grading Curves

Sieve Analysis and Grading Curves

Sieve analysis is a method used to determine the particle size distribution of aggregate materials. This process involves the following steps:

Goodness-of-Fit Test

Goodness-of-Fit Test

The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Crisis-Optimized Patient Tracking: A Novel Approach for Mass Casualty Incident Management in Emergency Departments.

Disaster medicine and public health preparedness·2026

Same author

Improving Latent Trait Estimation in Multidimensional Forced Choice Measures: Latent Regression Multi-Unidimensional Pairwise Preference Model.

Applied psychological measurement·2026

Same author

Time versus nature: Longitudinal effects of job stressors on work outcomes.

Journal of occupational health psychology·2025

Same author

Evidence That the DSM-5-TR Eating Disorder Criteria Perform Differently Across Cisgender and Transgender or Gender Diverse Individuals Using the Eating Disorder Diagnostic Scale.

The International journal of eating disorders·2025

Same author

<i>fcirt</i>: An R Package for Forced Choice Models in Item Response Theory.

Applied psychological measurement·2025

Same author

Detecting DIF with the Multi-Unidimensional Pairwise Preference Model: Lord's Chi-square and IPR-NCDIF Methods.

Applied psychological measurement·2025

Same journal

babebi: An R Package for Bayesian Estimation and Validation in Small-N Two-Rater Pre-Post Designs.

Applied psychological measurement·2026

Same journal

A Tool for Agreement and Alignment Analysis in Binary Rating Tasks: The R Package scindex.

Applied psychological measurement·2026

Same journal

The EM Algorithm and Its Variants in Cognitive Diagnostic Models: Comparing Their Propensity for Boundaries, Extremes, Convergence, and Suboptimal Solutions.

Applied psychological measurement·2026

Same journal

When Perceptions of Social Desirability Differ: Implications for the Multidimensional Nominal Response Model of Faking.

Applied psychological measurement·2026

Same journal

csemGT: An R Package for Estimating Raw-Score Conditional Standard Errors of Measurement in Generalizability Theory.

Applied psychological measurement·2026

Same journal

Confirmatory Factor Analysis with Adaptive Quadrature Estimator Using Four Link Functions.

Applied psychological measurement·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 30, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Bayesian Approaches for Detecting Differential Item Functioning Using the Generalized Graded Unfolding Model.

Seang-Hwane Joo¹, Philseok Lee², Stephen Stark³

¹The University of Kansas, Lawrence, KS, USA.

Applied Psychological Measurement

|March 14, 2022

Summary

This summary is machine-generated.

Two Bayesian methods, Bayes factor (BF) and deviance information criterion (DIC), show excellent performance for differential item functioning (DIF) analysis in psychological assessment, outperforming traditional methods when group distributions differ.

Keywords:

Bayes factor deviance information criterion differential item functioning ideal point item response theory

More Related Videos

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

Computerized Adaptive Testing System of Functional Assessment of Stroke

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Related Experiment Videos

Last Updated: Sep 30, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

Computerized Adaptive Testing System of Functional Assessment of Stroke

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Area of Science:

Psychological Assessment
Psychometrics
Statistical Modeling

Background:

Differential item functioning (DIF) analysis is crucial in psychological assessment for ensuring fairness.
Item response theory (IRT) provides a framework for DIF analysis.
Bayesian methods offer alternative approaches to traditional likelihood-based methods.

Purpose of the Study:

To evaluate the performance of two Bayesian DIF methods (Bayes factor and deviance information criterion) within the generalized graded unfolding model (GGUM).
To compare the effectiveness of Bayesian methods against likelihood-based methods (likelihood ratio test and Akaike information criterion) in detecting DIF.
To investigate the impact of various factors (sample size, DIF characteristics, subgroup distributions) on DIF detection accuracy.

Main Methods:

A Monte Carlo simulation study was conducted.
The generalized graded unfolding model (GGUM) was used.
Bayesian (Bayes factor, DIC) and likelihood-based (LR, AIC) DIF detection methods were implemented and compared.

Main Results:

Bayesian methods (BF and DIC) demonstrated well-controlled Type I error rates and high statistical power.
The Bayesian methods outperformed likelihood-based methods (LR and AIC) in controlling Type I error rates, especially when subgroup trait distributions differed.
Performance was evaluated under various simulation conditions including sample size, DIF source, size, location, and baseline model type.

Conclusions:

Bayesian DIF analysis using BF and DIC with GGUM offers a robust and effective approach for psychological assessment.
These Bayesian methods provide superior control over Type I errors compared to traditional likelihood-based methods under specific conditions.
Recommendations are provided for the application of these advanced DIF detection techniques in applied research.