Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.4K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

782
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
782
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

319
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
319
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

2.9K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
2.9K
Sieve Analysis and Grading Curves01:19

Sieve Analysis and Grading Curves

571
Sieve analysis is a method used to determine the particle size distribution of aggregate materials. This process involves the following steps:
571
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

4.4K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
4.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Crisis-Optimized Patient Tracking: A Novel Approach for Mass Casualty Incident Management in Emergency Departments.

Disaster medicine and public health preparedness·2026
Same author

Improving Latent Trait Estimation in Multidimensional Forced Choice Measures: Latent Regression Multi-Unidimensional Pairwise Preference Model.

Applied psychological measurement·2026
Same author

Time versus nature: Longitudinal effects of job stressors on work outcomes.

Journal of occupational health psychology·2025
Same author

Evidence That the DSM-5-TR Eating Disorder Criteria Perform Differently Across Cisgender and Transgender or Gender Diverse Individuals Using the Eating Disorder Diagnostic Scale.

The International journal of eating disorders·2025
Same author

<i>fcirt</i>: An R Package for Forced Choice Models in Item Response Theory.

Applied psychological measurement·2025
Same author

Detecting DIF with the Multi-Unidimensional Pairwise Preference Model: Lord's Chi-square and IPR-NCDIF Methods.

Applied psychological measurement·2025
Same journal

babebi: An R Package for Bayesian Estimation and Validation in Small-N Two-Rater Pre-Post Designs.

Applied psychological measurement·2026
Same journal

A Tool for Agreement and Alignment Analysis in Binary Rating Tasks: The R Package scindex.

Applied psychological measurement·2026
Same journal

The EM Algorithm and Its Variants in Cognitive Diagnostic Models: Comparing Their Propensity for Boundaries, Extremes, Convergence, and Suboptimal Solutions.

Applied psychological measurement·2026
Same journal

When Perceptions of Social Desirability Differ: Implications for the Multidimensional Nominal Response Model of Faking.

Applied psychological measurement·2026
Same journal

csemGT: An R Package for Estimating Raw-Score Conditional Standard Errors of Measurement in Generalizability Theory.

Applied psychological measurement·2026
Same journal

Confirmatory Factor Analysis with Adaptive Quadrature Estimator Using Four Link Functions.

Applied psychological measurement·2026
See all related articles

Related Experiment Video

Updated: Sep 30, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

931

Bayesian Approaches for Detecting Differential Item Functioning Using the Generalized Graded Unfolding Model.

Seang-Hwane Joo1, Philseok Lee2, Stephen Stark3

  • 1The University of Kansas, Lawrence, KS, USA.

Applied Psychological Measurement
|March 14, 2022
PubMed
Summary
This summary is machine-generated.

Two Bayesian methods, Bayes factor (BF) and deviance information criterion (DIC), show excellent performance for differential item functioning (DIF) analysis in psychological assessment, outperforming traditional methods when group distributions differ.

Keywords:
Bayes factordeviance information criteriondifferential item functioningideal pointitem response theory

More Related Videos

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing
15:00

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

790
Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

6.0K

Related Experiment Videos

Last Updated: Sep 30, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

931
A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing
15:00

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

790
Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

6.0K

Area of Science:

  • Psychological Assessment
  • Psychometrics
  • Statistical Modeling

Background:

  • Differential item functioning (DIF) analysis is crucial in psychological assessment for ensuring fairness.
  • Item response theory (IRT) provides a framework for DIF analysis.
  • Bayesian methods offer alternative approaches to traditional likelihood-based methods.

Purpose of the Study:

  • To evaluate the performance of two Bayesian DIF methods (Bayes factor and deviance information criterion) within the generalized graded unfolding model (GGUM).
  • To compare the effectiveness of Bayesian methods against likelihood-based methods (likelihood ratio test and Akaike information criterion) in detecting DIF.
  • To investigate the impact of various factors (sample size, DIF characteristics, subgroup distributions) on DIF detection accuracy.

Main Methods:

  • A Monte Carlo simulation study was conducted.
  • The generalized graded unfolding model (GGUM) was used.
  • Bayesian (Bayes factor, DIC) and likelihood-based (LR, AIC) DIF detection methods were implemented and compared.

Main Results:

  • Bayesian methods (BF and DIC) demonstrated well-controlled Type I error rates and high statistical power.
  • The Bayesian methods outperformed likelihood-based methods (LR and AIC) in controlling Type I error rates, especially when subgroup trait distributions differed.
  • Performance was evaluated under various simulation conditions including sample size, DIF source, size, location, and baseline model type.

Conclusions:

  • Bayesian DIF analysis using BF and DIC with GGUM offers a robust and effective approach for psychological assessment.
  • These Bayesian methods provide superior control over Type I errors compared to traditional likelihood-based methods under specific conditions.
  • Recommendations are provided for the application of these advanced DIF detection techniques in applied research.