Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Confidence Coefficient01:24

Confidence Coefficient

7.5K
The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...
7.5K
Critical Values01:31

Critical Values

6.7K
A critical value is a definite value obtained from a particular probability distribution at a predecided confidence level (or a predecided significance level) for a given population parameter. The critical value provides demarcation that separates the sample statistics that are likely to occur from the ones that are unlikely to occur based on the given probability distribution and the population parameter to be estimated. The critical value for normal distribution is obtained from the z...
6.7K
Bonferroni Test01:10

Bonferroni Test

2.7K
The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...
2.7K
Statistical Analysis: Overview01:11

Statistical Analysis: Overview

6.0K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
6.0K
z Scores and Area Under the Curve01:17

z Scores and Area Under the Curve

10.4K
z scores are the standardized values obtained after converting a normal distribution into a standard normal distribution. A z score is measured in units of the standard deviation. The z score tells you how many standard deviations the value x is above (to the right of) or below (to the left of) the mean, μ. Values of x that are larger than the mean have positive z scores, and values of x that are smaller than the mean have negative z scores. If x equals the mean, then x has a z score of...
10.4K
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

3.2K
One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
3.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same authorSame journal

Exploring psychological tradeoffs: Developing and demonstrating an R Shiny app for Pareto optimization.

Behavior research methods·2026
Same author

Early Screening for Decoding- and Language-Related Reading Difficulties in First and Third Grades.

Assessment for effective intervention : official journal of the Council for Educational Diagnostic Services·2026
Same author

Adolescent Sexting in Romantic Relationships and Daily Positive and Negative Affect Dynamics: A Dyadic Intensive Longitudinal Study.

Computers in human behavior·2026
Same author

A primer on intensive longitudinal psychometrics.

Behavior research methods·2026
Same author

Dynamic measurement invariance cutoffs for longitudinal and dyadic data.

Behavior research methods·2026
Same author

Evidence-based practice attitude scale for Latinx mental health professionals: a novel application of confirmatory factor analysis.

Implementation science communications·2026
Same journal

The performance of Bayesian fit measures in detecting misspecified multilevel structural equation modeling.

Behavior research methods·2026
Same journal

Psychometric functions from multiple responses : Dedicated to the memory of Colin L. Mallows.

Behavior research methods·2026
Same journal

Low-cost, open-source, full-stack software and Arduino-based hardware for control of commercially available animal behavior systems.

Behavior research methods·2026
Same journal

PyNeon: A Python package for the analysis of Neon multimodal mobile eye-tracking data.

Behavior research methods·2026
Same journal

Talking surveys: How photorealistic embodied conversational agents shape response quality, engagement, and satisfaction.

Behavior research methods·2026
See all related articles

Related Experiment Video

Updated: May 28, 2025

The α-test: Rapid Cell-free CD4 Enumeration Using Whole Saliva
11:22

The α-test: Rapid Cell-free CD4 Enumeration Using Whole Saliva

Published on: May 16, 2012

13.8K

Reliability representativeness: How well does coefficient alpha summarize reliability across the score distribution?

Daniel McNeish1, Denis Dumas2

  • 1Department of Psychology, Arizona State University, PO Box 871104, Tempe, AZ, 85287, USA. dmcneish@asu.edu.

Behavior Research Methods
|February 10, 2025
PubMed
Summary
This summary is machine-generated.

Coefficient alpha provides a single reliability score, but reliability can vary across a scale. This study introduces methods to compare coefficient alpha with conditional reliability, offering a clearer understanding of score precision.

Keywords:
Coefficient alphaConditional reliabilityCronbach's alphaOmegaReliability

More Related Videos

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

670
Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity
08:40

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

7.4K

Related Experiment Videos

Last Updated: May 28, 2025

The α-test: Rapid Cell-free CD4 Enumeration Using Whole Saliva
11:22

The α-test: Rapid Cell-free CD4 Enumeration Using Whole Saliva

Published on: May 16, 2012

13.8K
Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

670
Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity
08:40

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

7.4K

Area of Science:

  • Psychometrics
  • Psychological Measurement
  • Statistical Modeling

Background:

  • Psychological scales commonly report reliability using coefficient alpha, which assumes uniform reliability across all score levels.
  • However, reliability can be conditional, varying across the score distribution, a concept well-established in Item Response Theory (IRT) but less understood by many psychologists.
  • The representativeness of a single reliability index like alpha can be misleading when reliability differs significantly across score ranges.

Purpose of the Study:

  • To address the limitations of coefficient alpha by exploring conditional reliability.
  • To develop and present methods, an R package, and a Shiny application for quantifying differences between coefficient alpha and conditional reliability.
  • To enable psychologists to better contextualize and interpret the reliability of their scale scores.

Main Methods:

  • Development of a novel statistical method to assess conditional reliability across the score distribution.
  • Implementation of this method within a user-friendly R package.
  • Creation of an interactive Shiny application for visualizing and comparing coefficient alpha with conditional reliability estimates.

Main Results:

  • Demonstration that coefficient alpha may be unrepresentative when conditional reliability is heterogeneous across the score distribution.
  • Quantification of potential discrepancies between a global reliability index and reliability at specific score points, such as cut-offs.
  • Validation of the proposed methods and tools for practical application in psychological research.

Conclusions:

  • Coefficient alpha may not always accurately reflect the reliability of specific scores, especially at critical decision points.
  • The developed tools facilitate a more nuanced understanding of scale reliability beyond a single summary statistic.
  • Psychologists are encouraged to consider conditional reliability for a more comprehensive evaluation of measurement precision.