Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Standard Error of the Mean

Standard Error of the Mean

The sampling variability of a statistic is defined as how much the statistic varies from one sample to another. The sampling variability of a statistic is typically measured by measuring its standard error.
The standard error of the mean is an example of a standard error. It is a unique standard deviation known as the standard deviation of the sampling distribution of the mean. The standard error of the mean is a statistic that calculates how correctly a sample distribution represents a...

Calculating and Interpreting the Linear Correlation Coefficient

Calculating and Interpreting the Linear Correlation Coefficient

The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable, x, and the dependent variable, y. Hence, it is also known as the Pearson product-moment correlation coefficient. It can be calculated using the following equation:

Margin of Error

Margin of Error

The margin of error is also called the maximum error of an estimate. The margin of error is the maximum possible or expected difference between the observed sample parameter value and the actual population parameter value. For proportion, it is the maximum difference between the value of sample proportion obtained from the data and the true value of population proportion. As the true value of the population parameter is not known, the margin of error is calculated using the sample statistic.

Confidence Coefficient

Confidence Coefficient

The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...

Kendall's Coefficient of Concordance

Kendall's Coefficient of Concordance

Kendall's Coefficient of Concordance (W), also known as Kendall's W, is a non-parametric statistical measure used to assess the agreement or concordance between multiple raters or judges when they rank a set of items. It is often used when you have ordinal data (ranks) and you want to see if there is consistency or consensus among the raters. It is widely applied in research areas such as psychology, medicine, and social sciences, where multiple judges are asked to rank or rate subjects...

Empirical Method to Interpret Standard Deviation

Empirical Method to Interpret Standard Deviation

The empirical rule, also known as the three-sigma rule, allows a statistician to interpret the standard deviation in a normally distributed dataset. The rule states that 68% of the data lies within one standard deviation from the mean, 95% lies within two standard deviations from the mean, and 99.7% lies within three standard deviations from the mean. Additionally, this rule is also called the 68-95-99.7 rule.
This rule is used widely in statistics to calculate the proportion of data values...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Bias and precision in true-score estimation.

The British journal of mathematical and statistical psychology·2026

Same author

How to Estimate Intraclass Correlation Coefficients for Interrater Reliability from Planned Incomplete Data.

Multivariate behavioral research·2025

Same author

Interrater Reliability for Interdependent Social Network Data: A Generalizability Theory Approach.

Multivariate behavioral research·2025

Same author

Maximum Augmented Empirical Likelihood Estimation of Categorical Marginal Models for Large Sparse Contingency Tables.

Psychometrika·2023

Same author

Updated guidelines on selecting an intraclass correlation coefficient for interrater reliability, with applications to incomplete observational designs.

Psychological methods·2022

Same author

Advances in nonparametric item response theory for scale construction in quality-of-life research.

Quality of life research : an international journal of quality of life aspects of treatment, care and rehabilitation·2021

Same journal

BAYESIAN MIXED MULTIDIMENSIONAL SCALING FOR AUDITORY PROCESSING.

Psychometrika·2026

Same journal

Testing linear hypotheses in repeated measures generalized linear models using external information.

Psychometrika·2026

Same journal

When Do Unifactorial Items Increase the Reliability?

Psychometrika·2026

Same journal

Longitudinal Designs for Diagnostic Models: Identification and Estimation.

Psychometrika·2026

Same journal

Modeling Rare Events and Nonmonotone Nonignorable Missingness of Time-Varying Outcomes and Predictors in Binary Time-Series Daily Diary Data: A Bayesian Selection Model.

Psychometrika·2026

Same journal

Revelle's Beta: The Wait Is Over-Computation Becomes Possible.

Psychometrika·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 6, 2026

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Published on: December 15, 2023

Standard Errors for Reliability Coefficients.

L Andries van der Ark¹

¹Research Institute of Child Development and Education, https://ror.org/04dkp9463University of Amsterdam, Amsterdam, Netherlands.

|September 30, 2025

Summary

This summary is machine-generated.

This study introduces new analytic standard errors for reliability analysis coefficients in psychometrics. These methods provide crucial measurement precision estimates, especially for discrete scores in behavioral science research.

Keywords:

Multinomial sampling reliability analysis reliability coefficients standard errors statistical software

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

Related Experiment Videos

Last Updated: Jan 6, 2026

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Author Spotlight: Assessing the Reliability of Doppler Ultrasound in Measuring Leg Blood Flow

Published on: December 15, 2023

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Isokinetic Robotic Device to Improve Test-Retest and Inter-Rater Reliability for Stretch Reflex Measurements in Stroke Patients with Spasticity

Published on: June 12, 2019

Area of Science:

Psychometrics
Behavioral Sciences
Statistical Analysis

Background:

Reliability analysis is crucial in applied psychometrics for assessing item and scale scores.
Standard statistical software often lacks standard error calculations, despite their importance for measurement precision.
Existing methods for reliability analysis may not adequately address discrete score distributions common in behavioral sciences.

Purpose of the Study:

To develop and provide analytic nonparametric standard errors for reliability analysis coefficients.
To address the unavailability of standard errors in most statistical software packages.
To offer methods suitable for discrete score data prevalent in behavioral sciences.

Main Methods:

Derivation of standard errors under a multinomial sampling scheme for discrete scores.
Presentation of detailed derivations in appendices.
Development of R functions for computing standard errors, available via Open Science Framework.

Main Results:

Evaluated bias and variance of the derived standard errors using simulated item scores.
Assessed the coverage of Wald-based confidence intervals.
Found generally satisfactory bias, variance, and coverage for larger sample sizes and non-boundary parameter values.

Conclusions:

The proposed analytic nonparametric standard errors are a valuable addition to reliability analysis.
These methods enhance the assessment of measurement precision, particularly for discrete data.
The R functions provide accessible tools for researchers to implement these improved reliability analyses.