Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reliability and Validity01:29

Reliability and Validity

14.2K
Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.
14.2K
Comparing Experimental Results: Student's t-Test01:09

Comparing Experimental Results: Student's t-Test

6.1K
The t-test is a statistical method used to compare the sample mean with a population mean or compare two means from two data sets. The test statistic is calculated from the standard deviation, mean, and number of measurements in the data set at a selected confidence interval and then compared to a table of critical values at this confidence level. If the test statistic is smaller than the critical value, the null hypothesis is accepted. In this case, we state that the difference between the...
6.1K
Measures of Intelligence01:29

Measures of Intelligence

8.6K
Psychologists measure intelligence by using standardized tests that produce a score known as the intelligence quotient or IQ. To understand IQ tests, it's important to recognize the key principles behind their construction: validity, reliability, and standardization.
Validity refers to how well a test measures what it claims to measure. An intelligence test should accurately assess intelligence rather than another characteristic, like anxiety. Criterion validity is one way to evaluate this;...
8.6K
Testing a Claim about Standard Deviation01:19

Testing a Claim about Standard Deviation

3.0K
A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...
3.0K
Uncertainty in Measurement: Accuracy and Precision03:37

Uncertainty in Measurement: Accuracy and Precision

110.8K
Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value. 
110.8K
Spearman's Rank Correlation Test01:20

Spearman's Rank Correlation Test

1.5K
Spearman's rank correlation test, also known as Spearman's rho, is a nonparametric method for assessing the strength and direction of association between two variables. This test is particularly valuable when the data distribution is unknown or when the assumption of normality does not hold. Named after the English psychologist and statistician Dr. Charles Edward Spearman, it serves as the nonparametric counterpart to Pearson's correlation coefficient.
Spearman's test calculates correlation by...
1.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Combined Oral Contraceptive Use and Binge Eating.

JAMA network open·2026
Same author

Social Media Usage and Its Association With the Social Media Addiction Questionnaire Scale Among Early Adolescents.

JAACAP open·2026
Same author

umx version 4.5: Extending Twin and Path-Based SEM in R with CLPM, MR-DoC, Definition Variables, Ωnyx Integration, and Censored Distributions.

Twin research and human genetics : the official journal of the International Society for Twin Studies·2026
Same author

Social Determinants of Health and Pediatric Long COVID in the US.

JAMA pediatrics·2026
Same author

Clinical Manifestations.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2025
Same author

Public Health.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2025
Same journal

Maximum Likelihood and Bayesian Estimation in Cross-Domain Latent Growth Curve Modeling: The Impact of Reliability, Sample Size, and Missing Data.

Structural equation modeling : a multidisciplinary journal·2026
Same journal

Dynamic Modeling with Intensive Longitudinal Data: One-Step and Two-Step DSEM Approaches.

Structural equation modeling : a multidisciplinary journal·2026
Same journal

Accommodating Continuous Time Metrics within the Discrete-time Latent Change Score Model Using Definition Variables.

Structural equation modeling : a multidisciplinary journal·2025
Same journal

Does Cluster-Robust Estimation Provide Within-Study Effects? A Comparison of Individual Participant Data Methods in MASEM.

Structural equation modeling : a multidisciplinary journal·2025
Same journal

Two-Step Multilevel Latent Class Analysis in the Presence of Measurement Non-Equivalence.

Structural equation modeling : a multidisciplinary journal·2025
Same journal

Measurement Model Misspecification in Dynamic Structural Equation Models: Power, Reliability, and Other Considerations.

Structural equation modeling : a multidisciplinary journal·2025
See all related articles

Related Experiment Video

Updated: Feb 22, 2026

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow
09:18

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Published on: December 15, 2023

3.5K

Test Reliability at the Individual Level.

Yueqin Hu1, John R Nesselroade2, Monica K Erbacher3

  • 1Texas State University.

Structural Equation Modeling : a Multidisciplinary Journal
|September 23, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces new methods to measure individual test reliability, finding that test scores can vary significantly person-to-person. These psychometric approaches are useful for changeable attributes with repeated measures.

Keywords:
Individual ReliabilityPANASParallel TestsSEM

More Related Videos

Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers
09:16

Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers

Published on: March 14, 2018

10.8K
Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity
08:40

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

7.9K

Related Experiment Videos

Last Updated: Feb 22, 2026

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow
09:18

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Published on: December 15, 2023

3.5K
Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers
09:16

Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers

Published on: March 14, 2018

10.8K
Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity
08:40

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

7.9K

Area of Science:

  • Psychometrics
  • Psychological Measurement
  • Statistics

Background:

  • Reliability is a crucial psychometric property of tests.
  • Existing methods often assume uniform reliability across individuals.
  • Individual differences in measurement error are often overlooked.

Purpose of the Study:

  • To propose and evaluate methods for estimating test reliability at the individual level.
  • To investigate person-specific reliability using intraindividual variation.
  • To provide parallel forms of the Positive and Negative Affect Schedule (PANAS) for research.

Main Methods:

  • Developed two approaches: parallel tests and structural equation modeling (SEM).
  • Utilized intraindividual variation to estimate reliability for each person.
  • Conducted simulation studies and an empirical study with repeated measures on the PANAS.

Main Results:

  • Both the parallel tests and SEM approaches successfully recovered simulated reliability coefficients.
  • Empirical study demonstrated substantial person-to-person variation in PANAS reliability estimates.
  • Reliability of the PANAS was not uniform across all individuals.

Conclusions:

  • Individualized reliability estimation is feasible and reveals significant inter-individual differences.
  • The proposed methods are applicable to tests measuring changeable attributes with repeated assessments.
  • Findings highlight the importance of considering person-specific reliability in psychometric evaluations.