Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Test for Homogeneity01:23

Test for Homogeneity

1.7K
The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...
1.7K
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

588
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
588
Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test01:09

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

6.2K
In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with...
6.2K
Comparing Experimental Results: Student's t-Test01:09

Comparing Experimental Results: Student's t-Test

5.8K
The t-test is a statistical method used to compare the sample mean with a population mean or compare two means from two data sets. The test statistic is calculated from the standard deviation, mean, and number of measurements in the data set at a selected confidence interval and then compared to a table of critical values at this confidence level. If the test statistic is smaller than the critical value, the null hypothesis is accepted. In this case, we state that the difference between the...
5.8K
Significance Testing: Overview01:04

Significance Testing: Overview

10.1K
Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...
10.1K
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

7.1K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
7.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Tracking a multitude of abilities as they develop.

The British journal of mathematical and statistical psychology·2022
Same author

Measurement of Ability in Adaptive Learning and Assessment Systems when Learners Use On-Demand Hints.

Applied psychological measurement·2022
Same author

An Attention-Based Diffusion Model for Psychometric Analyses.

Psychometrika·2021
Same author

A Rasch Model and Rating System for Continuous Responses Collected in Large-Scale Learning Systems.

Frontiers in psychology·2021
Same author

Deviations of rational choice: an integrative explanation of the endowment and several context effects.

Scientific reports·2020
Same author

Tracking with (Un)Certainty.

Journal of Intelligence·2020
Same journal

Testing linear hypotheses in repeated measures generalized linear models using external information.

Psychometrika·2026
Same journal

When Do Unifactorial Items Increase the Reliability?

Psychometrika·2026
Same journal

Longitudinal Designs for Diagnostic Models: Identification and Estimation.

Psychometrika·2026
Same journal

Modeling Rare Events and Nonmonotone Nonignorable Missingness of Time-Varying Outcomes and Predictors in Binary Time-Series Daily Diary Data: A Bayesian Selection Model.

Psychometrika·2026
Same journal

Revelle's Beta: The Wait Is Over-Computation Becomes Possible.

Psychometrika·2026
Same journal

On dimensional implication graphs.

Psychometrika·2026
See all related articles

Related Experiment Video

Updated: Apr 23, 2026

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

1.3K

A Statistical Test for Differential Item Pair Functioning.

Timo M Bechger1, Gunter Maris

  • 1Cito, Amsterdamseweg 13, Arnhem, The Netherlands, timo.bechger@cito.nl.

Psychometrika
|September 17, 2014
PubMed
Summary
This summary is machine-generated.

This study introduces a new statistical test for differential item functioning (DIF) using item response theory (IRT). It defines DIF based on relative item difficulties, offering a novel approach for psychometric analysis.

More Related Videos

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

5.4K
Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits
08:27

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Published on: September 27, 2019

6.1K

Related Experiment Videos

Last Updated: Apr 23, 2026

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

1.3K
Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

5.4K
Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits
08:27

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Published on: September 27, 2019

6.1K

Area of Science:

  • Psychometrics
  • Statistical modeling

Background:

  • Differential item functioning (DIF) is crucial for detecting item bias in assessments.
  • Existing DIF detection methods often rely on individual item parameters, which can be difficult to estimate accurately.

Purpose of the Study:

  • To develop a novel IRT-based statistical test for DIF.
  • To define DIF in terms of relative item difficulties rather than absolute difficulties.

Main Methods:

  • The proposed test is based on the Rasch model, with extensions outlined for more complex IRT models.
  • It focuses on the relative difficulties of item pairs, which are identifiable from observed data.
  • The method is related to Lord's DIF test but offers a refined interpretation.

Main Results:

  • The new test provides a statistically sound method for identifying DIF.
  • Illustrations using both real and simulated data demonstrate the test's applicability and effectiveness.

Conclusions:

  • The proposed IRT-based DIF test offers a more robust approach by utilizing relative item difficulties.
  • This method enhances the accuracy and interpretability of DIF detection in psychometric research.