Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reliability and Validity

Reliability and Validity

Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.

Comparing Experimental Results: Student's t-Test

Comparing Experimental Results: Student's t-Test

The t-test is a statistical method used to compare the sample mean with a population mean or compare two means from two data sets. The test statistic is calculated from the standard deviation, mean, and number of measurements in the data set at a selected confidence interval and then compared to a table of critical values at this confidence level. If the test statistic is smaller than the critical value, the null hypothesis is accepted. In this case, we state that the difference between the...

Measures of Intelligence

Measures of Intelligence

Psychologists measure intelligence by using standardized tests that produce a score known as the intelligence quotient or IQ. To understand IQ tests, it's important to recognize the key principles behind their construction: validity, reliability, and standardization.
Validity refers to how well a test measures what it claims to measure. An intelligence test should accurately assess intelligence rather than another characteristic, like anxiety. Criterion validity is one way to evaluate this;...

Testing a Claim about Standard Deviation

Testing a Claim about Standard Deviation

A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...

Uncertainty in Measurement: Accuracy and Precision

Uncertainty in Measurement: Accuracy and Precision

Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value.

Spearman's Rank Correlation Test

Spearman's Rank Correlation Test

Spearman's rank correlation test, also known as Spearman's rho, is a nonparametric method for assessing the strength and direction of association between two variables. This test is particularly valuable when the data distribution is unknown or when the assumption of normality does not hold. Named after the English psychologist and statistician Dr. Charles Edward Spearman, it serves as the nonparametric counterpart to Pearson's correlation coefficient.
Spearman's test calculates correlation by...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Combined Oral Contraceptive Use and Binge Eating.

JAMA network open·2026

Same author

Social Media Usage and Its Association With the Social Media Addiction Questionnaire Scale Among Early Adolescents.

JAACAP open·2026

Same author

umx version 4.5: Extending Twin and Path-Based SEM in R with CLPM, MR-DoC, Definition Variables, Ωnyx Integration, and Censored Distributions.

Twin research and human genetics : the official journal of the International Society for Twin Studies·2026

Same author

Social Determinants of Health and Pediatric Long COVID in the US.

JAMA pediatrics·2026

Same author

Clinical Manifestations.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2025

Same author

Public Health.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2025

Same journal

Maximum Likelihood and Bayesian Estimation in Cross-Domain Latent Growth Curve Modeling: The Impact of Reliability, Sample Size, and Missing Data.

Structural equation modeling : a multidisciplinary journal·2026

Same journal

Dynamic Modeling with Intensive Longitudinal Data: One-Step and Two-Step DSEM Approaches.

Structural equation modeling : a multidisciplinary journal·2026

Same journal

Accommodating Continuous Time Metrics within the Discrete-time Latent Change Score Model Using Definition Variables.

Structural equation modeling : a multidisciplinary journal·2025

Same journal

Does Cluster-Robust Estimation Provide Within-Study Effects? A Comparison of Individual Participant Data Methods in MASEM.

Structural equation modeling : a multidisciplinary journal·2025

Same journal

Two-Step Multilevel Latent Class Analysis in the Presence of Measurement Non-Equivalence.

Structural equation modeling : a multidisciplinary journal·2025

Same journal

Measurement Model Misspecification in Dynamic Structural Equation Models: Power, Reliability, and Other Considerations.

Structural equation modeling : a multidisciplinary journal·2025

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 22, 2026

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Published on: December 15, 2023

Test Reliability at the Individual Level.

Yueqin Hu¹, John R Nesselroade², Monica K Erbacher³

¹Texas State University.

Structural Equation Modeling : a Multidisciplinary Journal

|September 23, 2017

Summary

This summary is machine-generated.

This study introduces new methods to measure individual test reliability, finding that test scores can vary significantly person-to-person. These psychometric approaches are useful for changeable attributes with repeated measures.

Keywords:

Individual Reliability PANAS Parallel Tests SEM

More Related Videos

Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers

Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers

Published on: March 14, 2018

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

Related Experiment Videos

Last Updated: Feb 22, 2026

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Published on: December 15, 2023

Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers

Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers

Published on: March 14, 2018

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

Area of Science:

Psychometrics
Psychological Measurement
Statistics

Background:

Reliability is a crucial psychometric property of tests.
Existing methods often assume uniform reliability across individuals.
Individual differences in measurement error are often overlooked.

Purpose of the Study:

To propose and evaluate methods for estimating test reliability at the individual level.
To investigate person-specific reliability using intraindividual variation.
To provide parallel forms of the Positive and Negative Affect Schedule (PANAS) for research.

Main Methods:

Developed two approaches: parallel tests and structural equation modeling (SEM).
Utilized intraindividual variation to estimate reliability for each person.
Conducted simulation studies and an empirical study with repeated measures on the PANAS.

Main Results:

Both the parallel tests and SEM approaches successfully recovered simulated reliability coefficients.
Empirical study demonstrated substantial person-to-person variation in PANAS reliability estimates.
Reliability of the PANAS was not uniform across all individuals.

Conclusions:

Individualized reliability estimation is feasible and reveals significant inter-individual differences.
The proposed methods are applicable to tests measuring changeable attributes with repeated assessments.
Findings highlight the importance of considering person-specific reliability in psychometric evaluations.