Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Response Surface Methodology01:16

Response Surface Methodology

91
Response Surface Methodology (RSM) is a collection of statistical and mathematical techniques used to develop, improve, and optimize processes. It is particularly valuable when many input variables or factors potentially influence a response variable.
The process of RSM involves several key steps:
91
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

146
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
146
Factorial Design02:01

Factorial Design

13.0K
Factorial Analysis is an experimental design that applies Analysis of Variance (ANOVA) statistical procedures to examine a change in a dependent variable due to more than one independent variable, also known as factors. Changes in worker productivity can be reasoned, for example, to be influenced by salary and other conditions, such as skill level. One way to test this hypothesis is by categorizing salary into three levels (low, moderate, and high) and skills sets into two levels (entry level...
13.0K
Interpreting R Charts01:22

Interpreting R Charts

51
R chart, or range chart, is a fundamental tool in statistical process control used to monitor the variability within a process. It complements the X-bar (x̄) chart by focusing on the range of the data, rather than individual values, providing a clear picture of the process dispersion over time.
An R chart plots the range of subsets of measurements collected from a process. Each point on the chart represents the range—defined as the difference between the maximum and minimum...
51
Two-Way ANOVA01:17

Two-Way ANOVA

2.6K
The two-way ANOVA is an extension of the one-way ANOVA. It is a statistical test performed on three or more samples categorized by two factors - a row factor and a column factor. Ronald Fischer mentioned it in 1925 in his book 'Statistical Methods for Researchers.'
The two-way ANOVA analysis initially begins by stating the null hypothesis that there is an interaction effect between the two factors of a dataset. This effect can be visualized using line segments formed by joining the...
2.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Enhanced invasiveness promotes the dominance of a widely-distributed carbapenem-resistant virulence-plasmid-carrying Klebsiella pneumoniae sublineage.

Nature communications·2026
Same author

Evaluating Neural Networks Architectures for Competency Prediction from Process Data Using PISA Computer-Based Mathematics Assessment.

Journal of Intelligence·2026
Same author

Genome sequence of lytic phage phi1_164023 targeting ST23 KL1-type carbapenem-resistant <i>Klebsiella pneumoniae</i>.

Microbiology resource announcements·2026
Same author

Medical SAM-Clip Grafting for brain tumor segmentation.

Computers in biology and medicine·2025
Same author

Cognitive apprenticeship in the anatomical sciences: A study of the relationship between the anatomical expertise and clinical expertise of medical students as demonstrated on standardized assessments.

Anatomical sciences education·2025
Same author

Investigating the effect of transformer encoder architecture to improve the reliability of classroom observation ratings on high-inference discourse.

Behavior research methods·2025
Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026
Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026
Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026
Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026
Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026
Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026
See all related articles

Related Experiment Video

Updated: Jun 6, 2025

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

5.8K

Exploring the Evidence to Interpret Differential Item Functioning via Response Process Data.

Ziying Li1, Jinnie Shin1, Huan Kuang2

  • 1University of Florida, Gainesville, USA.

Educational and Psychological Measurement
|December 2, 2024
PubMed
Summary
This summary is machine-generated.

Response process data, including timing and action sequences, significantly enhances the interpretation of differential item functioning (DIF) in assessments. This approach improves measurement fairness by revealing group differences in how test-takers respond.

Keywords:
DIFMantel–Haenszelrandom forestresponse process dataridge logistic regression

More Related Videos

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

692
Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities
10:26

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Published on: September 11, 2021

3.9K

Related Experiment Videos

Last Updated: Jun 6, 2025

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

5.8K
Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

692
Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities
10:26

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Published on: September 11, 2021

3.9K

Area of Science:

  • Educational Measurement
  • Psychometrics
  • Data Science

Background:

  • Differential item functioning (DIF) evaluation is crucial for ensuring assessment fairness across diverse subgroups.
  • Traditional DIF methods relying solely on item response scores present interpretability challenges.
  • Response process data offer novel insights into examinee behaviors, aiding DIF interpretation.

Purpose of the Study:

  • To explore the utility of response process data features for enhancing the interpretability of DIF items.
  • To focus on gender-based DIF within the Programme for International Assessment of Adult Competencies (PIAAC) 2012 numeracy assessment.
  • To identify key process data features that explain DIF.

Main Methods:

  • Utilized random forest and logistic regression with ridge regularization.
  • Investigated associations between process data features and DIF items.
  • Assessed model performance across varying percentages of DIF items to simulate real-world conditions.

Main Results:

  • Timing and action-sequence features were found to be highly informative.
  • These features effectively revealed response process differences between gender groups.
  • The combination of process data features significantly enhanced DIF item interpretability.

Conclusions:

  • Response process data provide a feasible method for understanding and interpreting DIF items.
  • This approach can illuminate reasons for discrepancies between DIF statistics and expert reviews.
  • Leveraging process data can help identify and mitigate irrelevant factors affecting measurement equity.