Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Induced-fit Model01:13

Induced-fit Model

89.5K
Most chemical reactions in cells require enzymes—biological catalysts that speed up the reaction without being consumed or permanently changed. They reduce the activation energy needed to convert the reactants into products. Enzymes are proteins, that usually work by binding to a substrate—a reactant molecule that they act upon.
Enzymes exhibit substrate specificity, meaning that they can only bind to certain substrates. This is mainly determined by the shape and chemical...
89.5K
Reliability and Validity01:29

Reliability and Validity

14.1K
Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.
14.1K
Inclusive Fitness00:57

Inclusive Fitness

42.1K
Most altruistic behavior—in which one animal helps another at a cost to themselves—occurs between relatives. Scientists think these altruistic behaviors evolved because they increase the inclusive fitness of the animal providing help.
42.1K
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

9.3K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
9.3K
Distribution Reliability and Automation01:25

Distribution Reliability and Automation

519
Distribution reliability in electrical power systems is critical for ensuring an uninterrupted power supply to consumers at minimal cost. According to IEEE Standard Terms, reliability is the probability that a device will function without failure over a specified time period or amount of usage. For electric power distribution, this translates to maintaining continuous power supply and addressing customer concerns over power outages. Several indices, as defined by IEEE Standard 1366-2012, are...
519
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

8.7K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
8.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Applying the Argument-Based Approach to Validation With Clinical Outcome Assessments: Strategies for Constructing a Rationale.

Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research·2026
Same author

Development and psychometric evaluation of a new self-report measure to assess patient engagement behaviours and capacity in the USA: the Patient Engagement Capacity Survey.

BMJ open·2025
Same author

The hf-PGA is a valid and reliable measure of hand/foot psoriasis severity in adults: results from a phase 2b clinical trial.

The Journal of dermatological treatment·2025
Same author

Evaluating When Subscores Add Value in Psychological and Health Applications.

Assessment·2024
Same author

Toward better outcome measurement for insomnia in children with autism spectrum disorder.

Autism : the international journal of research and practice·2024
Same author

Measurement Invariance in Intellectual and Developmental Disability Research.

American journal on intellectual and developmental disabilities·2024
Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026
Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026
Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026
Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026
Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026
Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026
See all related articles

Related Experiment Video

Updated: Feb 10, 2026

A Quantitative Fitness Analysis Workflow
11:39

A Quantitative Fitness Analysis Workflow

Published on: August 13, 2012

15.0K

Reliability and Model Fit.

Leanne M Stanley1, Michael C Edwards1

  • 1The Ohio State University, Columbus, OH, USA.

Educational and Psychological Measurement
|May 26, 2018
PubMed
Summary
This summary is machine-generated.

Assessing test score reliability and psychometric model fit are distinct but crucial for validity. Discrepancies between reliability and model fit offer valuable insights for interpreting test score uses.

Keywords:
factor analysisitem response theorymodel fitreliability

More Related Videos

Evaluation of a Reliable Biomarker in a Cecal Ligation and Puncture-Induced Mouse Model of Sepsis
05:28

Evaluation of a Reliable Biomarker in a Cecal Ligation and Puncture-Induced Mouse Model of Sepsis

Published on: December 9, 2022

4.5K
Imaging In-Stent Restenosis: An Inexpensive, Reliable, and Rapid Preclinical Model
09:46

Imaging In-Stent Restenosis: An Inexpensive, Reliable, and Rapid Preclinical Model

Published on: September 14, 2009

14.3K

Related Experiment Videos

Last Updated: Feb 10, 2026

A Quantitative Fitness Analysis Workflow
11:39

A Quantitative Fitness Analysis Workflow

Published on: August 13, 2012

15.0K
Evaluation of a Reliable Biomarker in a Cecal Ligation and Puncture-Induced Mouse Model of Sepsis
05:28

Evaluation of a Reliable Biomarker in a Cecal Ligation and Puncture-Induced Mouse Model of Sepsis

Published on: December 9, 2022

4.5K
Imaging In-Stent Restenosis: An Inexpensive, Reliable, and Rapid Preclinical Model
09:46

Imaging In-Stent Restenosis: An Inexpensive, Reliable, and Rapid Preclinical Model

Published on: September 14, 2009

14.3K

Area of Science:

  • Psychometrics
  • Educational Measurement
  • Health Outcomes Research

Background:

  • Test score validity relies on both reliability and measurement model fit.
  • Investigators often assume reliability and model fit are simultaneously acceptable or unacceptable.
  • This study explores scenarios where model fit is adequate, but reliability is not.

Purpose of the Study:

  • To differentiate between test score reliability and psychometric model fit.
  • To emphasize the importance of evaluating both for score validity.
  • To examine situations with acceptable model fit but unacceptable reliability.

Main Methods:

  • Simulated data based on Patient Reported Outcomes Measurement Information System (PROMIS) anxiety item bank.
  • Analysis using Classical Test Theory, Factor Analysis, and Item Response Theory.
  • Application of analytic techniques from diverse psychometric traditions.

Main Results:

  • Reliability and model fit are demonstrably distinct psychometric properties.
  • Disagreement between reliability and model fit indices provides unique information.
  • This information is valuable for validity arguments, irrespective of data analysis methods.

Conclusions:

  • The assessment of both reliability and model fit yields critical information for validity.
  • Understanding the distinction is vital for appropriate interpretation and use of test scores.
  • Discrepancies highlight areas needing further investigation in psychometric evaluations.