Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Induced-fit Model

Induced-fit Model

Most chemical reactions in cells require enzymes—biological catalysts that speed up the reaction without being consumed or permanently changed. They reduce the activation energy needed to convert the reactants into products. Enzymes are proteins, that usually work by binding to a substrate—a reactant molecule that they act upon.
Enzymes exhibit substrate specificity, meaning that they can only bind to certain substrates. This is mainly determined by the shape and chemical...

Reliability and Validity

Reliability and Validity

Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.

Inclusive Fitness

Inclusive Fitness

Most altruistic behavior—in which one animal helps another at a cost to themselves—occurs between relatives. Scientists think these altruistic behaviors evolved because they increase the inclusive fitness of the animal providing help.

Goodness-of-Fit Test

Goodness-of-Fit Test

The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...

Distribution Reliability and Automation

Distribution Reliability and Automation

Distribution reliability in electrical power systems is critical for ensuring an uninterrupted power supply to consumers at minimal cost. According to IEEE Standard Terms, reliability is the probability that a device will function without failure over a specified time period or amount of usage. For electric power distribution, this translates to maintaining continuous power supply and addressing customer concerns over power outages. Several indices, as defined by IEEE Standard 1366-2012, are...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Applying the Argument-Based Approach to Validation With Clinical Outcome Assessments: Strategies for Constructing a Rationale.

Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research·2026

Same author

Development and psychometric evaluation of a new self-report measure to assess patient engagement behaviours and capacity in the USA: the Patient Engagement Capacity Survey.

BMJ open·2025

Same author

The hf-PGA is a valid and reliable measure of hand/foot psoriasis severity in adults: results from a phase 2b clinical trial.

The Journal of dermatological treatment·2025

Same author

Evaluating When Subscores Add Value in Psychological and Health Applications.

Assessment·2024

Same author

Toward better outcome measurement for insomnia in children with autism spectrum disorder.

Autism : the international journal of research and practice·2024

Same author

Measurement Invariance in Intellectual and Developmental Disability Research.

American journal on intellectual and developmental disabilities·2024

Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026

Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026

Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026

Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026

Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026

Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 10, 2026

A Quantitative Fitness Analysis Workflow

A Quantitative Fitness Analysis Workflow

Published on: August 13, 2012

Reliability and Model Fit.

Leanne M Stanley¹, Michael C Edwards¹

¹The Ohio State University, Columbus, OH, USA.

Educational and Psychological Measurement

|May 26, 2018

Summary

This summary is machine-generated.

Assessing test score reliability and psychometric model fit are distinct but crucial for validity. Discrepancies between reliability and model fit offer valuable insights for interpreting test score uses.

Keywords:

factor analysis item response theory model fit reliability

More Related Videos

Evaluation of a Reliable Biomarker in a Cecal Ligation and Puncture-Induced Mouse Model of Sepsis

Evaluation of a Reliable Biomarker in a Cecal Ligation and Puncture-Induced Mouse Model of Sepsis

Published on: December 9, 2022

Imaging In-Stent Restenosis: An Inexpensive, Reliable, and Rapid Preclinical Model

Imaging In-Stent Restenosis: An Inexpensive, Reliable, and Rapid Preclinical Model

Published on: September 14, 2009

Related Experiment Videos

Last Updated: Feb 10, 2026

A Quantitative Fitness Analysis Workflow

A Quantitative Fitness Analysis Workflow

Published on: August 13, 2012

Evaluation of a Reliable Biomarker in a Cecal Ligation and Puncture-Induced Mouse Model of Sepsis

Evaluation of a Reliable Biomarker in a Cecal Ligation and Puncture-Induced Mouse Model of Sepsis

Published on: December 9, 2022

Imaging In-Stent Restenosis: An Inexpensive, Reliable, and Rapid Preclinical Model

Imaging In-Stent Restenosis: An Inexpensive, Reliable, and Rapid Preclinical Model

Published on: September 14, 2009

Area of Science:

Psychometrics
Educational Measurement
Health Outcomes Research

Background:

Test score validity relies on both reliability and measurement model fit.
Investigators often assume reliability and model fit are simultaneously acceptable or unacceptable.
This study explores scenarios where model fit is adequate, but reliability is not.

Purpose of the Study:

To differentiate between test score reliability and psychometric model fit.
To emphasize the importance of evaluating both for score validity.
To examine situations with acceptable model fit but unacceptable reliability.

Main Methods:

Simulated data based on Patient Reported Outcomes Measurement Information System (PROMIS) anxiety item bank.
Analysis using Classical Test Theory, Factor Analysis, and Item Response Theory.
Application of analytic techniques from diverse psychometric traditions.

Main Results:

Reliability and model fit are demonstrably distinct psychometric properties.
Disagreement between reliability and model fit indices provides unique information.
This information is valuable for validity arguments, irrespective of data analysis methods.

Conclusions:

The assessment of both reliability and model fit yields critical information for validity.
Understanding the distinction is vital for appropriate interpretation and use of test scores.
Discrepancies highlight areas needing further investigation in psychometric evaluations.