Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Differential Leveling

Differential Leveling

Differential leveling is a precise method in surveying used to determine the elevation difference between two points. Its primary goal is to establish accurate vertical measurements to create level surfaces or grade lines critical for designing and constructing infrastructures such as roads, bridges, and buildings.The procedure for differential leveling begins with setting up and leveling the instrument at a point where the benchmark can be seen. The level rod is held on the benchmark (BM), and...

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

One-Way ANOVA: Unequal Sample Sizes

One-Way ANOVA: Unequal Sample Sizes

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

Ratio Level of Measurement

Ratio Level of Measurement

The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
A set of data measured using the ratio scale takes care of the ratio problem and provides complete information. Ratio scale data are like interval scale data, except they have a zero point and ratios can be calculated....

Data Validation

Data Validation

Method validation is a crucial process in analytical chemistry designed to confirm that a given method consistently produces reliable and high-quality results. This process is essential when a method is applied to different sample matrices or when procedural modifications are made, ensuring that the results meet acceptable standards across various applications.
Key parameters for method validation include:

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Association of ACGME Milestones With Other Performance Measures in General Surgery: A Meta-Analytic Study.

Academic medicine : journal of the Association of American Medical Colleges·2025

Same author

Construct Validity Evidence for ACGME Milestones in Surgical Specialties: A Systematic Review.

Journal of graduate medical education·2025

Same author

Longitudinal Reliability of Milestones Learning Trajectories during Anesthesiology Residency.

Anesthesiology·2025

Same author

Self-Regulated Learning and Learning Outcomes in Undergraduate and Graduate Medical Education: A Meta-Analysis.

Evaluation & the health professions·2024

Same author

Evaluating the Effects of Missing Data Handling Methods on Scale Linking Accuracy.

Educational and psychological measurement·2023

Same author

Extended Multivariate Generalizability Theory With Complex Design Structures.

Educational and psychological measurement·2022

Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026

Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026

Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026

Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026

Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026

Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 26, 2025

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

Evaluating Equating Methods for Varying Levels of Form Difference.

Ting Sun¹, Stella Yun Kim²

¹University of Utah, Salt Lake City, USA.

Educational and Psychological Measurement

|May 17, 2024

Summary

This summary is machine-generated.

Choosing the right statistical equating method depends on how different test forms are in difficulty. This study guides practitioners on selecting appropriate equating techniques for accurate score interpretation.

Keywords:

equating form difficulty

More Related Videos

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Published on: September 27, 2019

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Related Experiment Videos

Last Updated: Jun 26, 2025

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Published on: September 27, 2019

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Area of Science:

Psychometrics
Educational Measurement
Statistical Modeling

Background:

Equating adjusts for test form difficulty differences to ensure comparable scores.
Current equating practices often overlook the magnitude of form difficulty variations.
Lack of guidance exists for selecting equating methods based on form difficulty differences.

Purpose of the Study:

To investigate the impact of varying form difficulty differences on equating accuracy.
To compare the performance of different equating methods under distinct difficulty scenarios.
To provide evidence-based recommendations for equating method selection.

Main Methods:

Simulation study evaluating six equating methods.
Two common equating designs were examined: random group (RG) and common-item nonequivalent group (CINEG).
Conditions included varying levels of form difficulty difference, from none to large.

Main Results:

Under the RG design, mean equating excelled with no/small differences; equipercentile was superior for medium/large differences.
For CINEG design, Tucker Linear performed best with small/medium differences; chained equipercentile or frequency estimation was optimal for large differences.
Equating method performance is significantly influenced by the magnitude of form difficulty disparity.

Conclusions:

The study offers crucial guidance for selecting appropriate equating methods based on form difficulty.
Results inform testing companies on optimal equating strategies for similar and dissimilar test forms.
Accurate score comparability relies on matching equating methods to the degree of form difficulty difference.