Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Standard Error of the Mean01:13

Standard Error of the Mean

11.7K
The sampling variability of a statistic is defined as how much the statistic varies from one sample to another. The sampling variability of a statistic is typically measured by measuring its standard error.
The standard error of the mean is an example of a standard error. It is a unique standard deviation known as the standard deviation of the sampling distribution of the mean. The standard error of the mean is a statistic that calculates how correctly a sample distribution represents a...
11.7K
Calculating and Interpreting the Linear Correlation Coefficient01:11

Calculating and Interpreting the Linear Correlation Coefficient

7.6K
The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:
7.6K
Margin of Error01:27

Margin of Error

6.8K
The margin of error is also called the maximum error of an estimate. The margin of error is the maximum possible or expected difference between the observed sample parameter value and the actual population parameter value. For proportion, it is the maximum difference between the value of sample proportion obtained from the data and the true value of population proportion. As the true value of the population parameter is not known, the margin of error is calculated using the sample statistic.
6.8K
Confidence Coefficient01:24

Confidence Coefficient

10.2K
The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...
10.2K
Kendall's Coefficient of Concordance01:20

Kendall's Coefficient of Concordance

909
Kendall's Coefficient of Concordance (W), also known as Kendall's W, is a non-parametric statistical measure used to assess the agreement or concordance between multiple raters or judges when they rank a set of items. It is often used when you have ordinal data (ranks) and you want to see if there is consistency or consensus among the raters. It is widely applied in research areas such as psychology, medicine, and social sciences, where multiple judges are asked to rank or rate subjects...
909
Empirical Method to Interpret Standard Deviation01:09

Empirical Method to Interpret Standard Deviation

9.2K
The empirical rule, also known as the three-sigma rule, allows a statistician to interpret the standard deviation in a normally distributed dataset. The rule states that 68% of the data lies within one standard deviation from the mean, 95% lies within two standard deviations from the mean, and 99.7% lies within three standard deviations from the mean. Additionally, this rule is also called the 68-95-99.7 rule.
This rule is used widely in statistics to calculate the proportion of data values...
9.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Bias and precision in true-score estimation.

The British journal of mathematical and statistical psychology·2026
Same author

How to Estimate Intraclass Correlation Coefficients for Interrater Reliability from Planned Incomplete Data.

Multivariate behavioral research·2025
Same author

Interrater Reliability for Interdependent Social Network Data: A Generalizability Theory Approach.

Multivariate behavioral research·2025
Same author

Maximum Augmented Empirical Likelihood Estimation of Categorical Marginal Models for Large Sparse Contingency Tables.

Psychometrika·2023
Same author

Updated guidelines on selecting an intraclass correlation coefficient for interrater reliability, with applications to incomplete observational designs.

Psychological methods·2022
Same author

Advances in nonparametric item response theory for scale construction in quality-of-life research.

Quality of life research : an international journal of quality of life aspects of treatment, care and rehabilitation·2021
Same journal

BAYESIAN MIXED MULTIDIMENSIONAL SCALING FOR AUDITORY PROCESSING.

Psychometrika·2026
Same journal

Testing linear hypotheses in repeated measures generalized linear models using external information.

Psychometrika·2026
Same journal

When Do Unifactorial Items Increase the Reliability?

Psychometrika·2026
Same journal

Longitudinal Designs for Diagnostic Models: Identification and Estimation.

Psychometrika·2026
Same journal

Modeling Rare Events and Nonmonotone Nonignorable Missingness of Time-Varying Outcomes and Predictors in Binary Time-Series Daily Diary Data: A Bayesian Selection Model.

Psychometrika·2026
Same journal

Revelle's Beta: The Wait Is Over-Computation Becomes Possible.

Psychometrika·2026
See all related articles

Related Experiment Video

Updated: Jan 6, 2026

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow
09:18

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Published on: December 15, 2023

3.4K

Standard Errors for Reliability Coefficients.

L Andries van der Ark1

  • 1Research Institute of Child Development and Education, https://ror.org/04dkp9463University of Amsterdam, Amsterdam, Netherlands.

Psychometrika
|September 30, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces new analytic standard errors for reliability analysis coefficients in psychometrics. These methods provide crucial measurement precision estimates, especially for discrete scores in behavioral science research.

Keywords:
Multinomial samplingreliability analysisreliability coefficientsstandard errorsstatistical software

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.7K
Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity
08:40

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

7.8K

Related Experiment Videos

Last Updated: Jan 6, 2026

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow
09:18

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Published on: December 15, 2023

3.4K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.7K
Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity
08:40

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

7.8K

Area of Science:

  • Psychometrics
  • Behavioral Sciences
  • Statistical Analysis

Background:

  • Reliability analysis is crucial in applied psychometrics for assessing item and scale scores.
  • Standard statistical software often lacks standard error calculations, despite their importance for measurement precision.
  • Existing methods for reliability analysis may not adequately address discrete score distributions common in behavioral sciences.

Purpose of the Study:

  • To develop and provide analytic nonparametric standard errors for reliability analysis coefficients.
  • To address the unavailability of standard errors in most statistical software packages.
  • To offer methods suitable for discrete score data prevalent in behavioral sciences.

Main Methods:

  • Derivation of standard errors under a multinomial sampling scheme for discrete scores.
  • Presentation of detailed derivations in appendices.
  • Development of R functions for computing standard errors, available via Open Science Framework.

Main Results:

  • Evaluated bias and variance of the derived standard errors using simulated item scores.
  • Assessed the coverage of Wald-based confidence intervals.
  • Found generally satisfactory bias, variance, and coverage for larger sample sizes and non-boundary parameter values.

Conclusions:

  • The proposed analytic nonparametric standard errors are a valuable addition to reliability analysis.
  • These methods enhance the assessment of measurement precision, particularly for discrete data.
  • The R functions provide accessible tools for researchers to implement these improved reliability analyses.