Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Fisher's Exact Test

Fisher's Exact Test

Fisher's exact test is a statistical significance test widely used to analyze 2x2 contingency tables, particularly in situations where sample sizes are small. Unlike the chi-squared test, which approximates P-values and assumes minimum expected frequencies of at least five in each cell, Fisher's exact test calculates the exact probability (P-value) of observing the data or more extreme results under the null hypothesis. This feature makes it especially valuable when the assumptions of...

McNemar's Test

McNemar's Test

McNemar's Test is a nonparametric statistical test used to determine if there is a significant difference in proportions between two related groups when the outcome is binary (e.g., yes/no, success/failure). It is beneficial when we have paired data, such as pre-test/post-test designs, where the same subjects are measured under two different conditions. The test is named after the statistician Quinn McNemar, who introduced it in 1947. It is commonly used in situations where subjects are...

Sample Size Calculation

Sample Size Calculation

Knowledge of the sample size is the first requirement to conduct random sampling or an experiment. The sample size is the total number of units, observations, or groups (in some cases) used to get the data to estimate a population parameter. As the name suggests, the sample size is that of the sample drawn from the population and differs from the population size.
The sample size for the given experiment or sampling effort is fundamental to any study design. Sample size decides the number of...

One-Way ANOVA: Unequal Sample Sizes

One-Way ANOVA: Unequal Sample Sizes

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

Accuracy and Errors in Hypothesis Testing

Accuracy and Errors in Hypothesis Testing

Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...

Sign Test for Matched Pairs

Sign Test for Matched Pairs

The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Statistical models for Alzheimer's disease clinical trials: Lessons learned from the DIAN-TU Platform Trial.

Journal of Alzheimer's disease : JAD·2026

Same author

Penalized estimation of linear transformation models for interval-censored data with time-dependent covariates.

Statistical methods in medical research·2026

Same author

Impact of male genital tract infections on semen quality: a systematic review and meta-analysis.

Fertility and sterility·2026

Same author

Testing disease progression under the proportional reduction in decline in Alzheimer's disease studies.

Journal of applied statistics·2026

Same author

Likelihood ratio test for the disease progression model to measure saved time in Alzheimer's disease.

Statistical methods in medical research·2026

Same author

Assessing safety and efficacy in subpopulations in Alzheimer's disease clinical trials: contextualizing representativeness.

Alzheimer's & dementia (New York, N. Y.)·2025

Same journal

Asymptotic online FWER control for dependent test statistics.

Statistical methods in medical research·2026

Same journal

Regression analysis of misclassified current status data with potentially unknown test accuracy.

Statistical methods in medical research·2026

Same journal

Bayesian multivariate linear mixed-effects models with varied association structures.

Statistical methods in medical research·2026

Same journal

Inference about the ratio of age-standardized rates between two overlapping populations.

Statistical methods in medical research·2026

Same journal

A robust neural network with random effects for subject-specific prediction of clustered count data.

Statistical methods in medical research·2026

Same journal

A comparison of methods for designing hybrid type 2 cluster-randomized trials with continuous effectiveness and implementation endpoints.

Statistical methods in medical research·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 12, 2026

A Within-Subject Experimental Design using an Object Location Task in Rats

A Within-Subject Experimental Design using an Object Location Task in Rats

Published on: May 6, 2021

Sample size calculation for agreement between two raters with binary endpoints using exact tests.

¹Department of Environmental and Occupational Health, Epidemiology and Biostatistics Program, School of Community Health Sciences, University of Nevada Las Vegas, Las Vegas, NV, USA.

Statistical Methods in Medical Research

|November 19, 2016

Summary

This summary is machine-generated.

New exact methods improve sample size calculations for two-rater agreement studies. These approaches offer better type I error control than traditional asymptotic methods, ensuring more reliable results in clinical trials.

Keywords:

Agreement test exact test kappa coefficient sample size unconditional test

More Related Videos

A Protocol of Manual Tests to Measure Sensation and Pain in Humans

A Protocol of Manual Tests to Measure Sensation and Pain in Humans

Published on: December 19, 2016

Performing Permanent Distal Middle Cerebral with Common Carotid Artery Occlusion in Aged Rats to Study Cortical Ischemia with Sustained Disability

Performing Permanent Distal Middle Cerebral with Common Carotid Artery Occlusion in Aged Rats to Study Cortical Ischemia with Sustained Disability

Published on: February 23, 2016

Related Experiment Videos

Last Updated: Mar 12, 2026

A Within-Subject Experimental Design using an Object Location Task in Rats

A Within-Subject Experimental Design using an Object Location Task in Rats

Published on: May 6, 2021

A Protocol of Manual Tests to Measure Sensation and Pain in Humans

A Protocol of Manual Tests to Measure Sensation and Pain in Humans

Published on: December 19, 2016

Performing Permanent Distal Middle Cerebral with Common Carotid Artery Occlusion in Aged Rats to Study Cortical Ischemia with Sustained Disability

Performing Permanent Distal Middle Cerebral with Common Carotid Artery Occlusion in Aged Rats to Study Cortical Ischemia with Sustained Disability

Published on: February 23, 2016

Area of Science:

Biostatistics
Clinical Trial Design
Statistical Methods

Background:

Traditional sample size calculations for inter-rater agreement rely on asymptotic methods.
Asymptotic approaches may offer unreliable sample sizes due to poor type I error control.

Purpose of the Study:

To introduce novel exact sample size calculation methods for two-rater agreement studies with binary endpoints.
To enhance the reliability of sample size determination by controlling the type I error rate.

Main Methods:

Proposed two exact sample size calculation approaches: one based on maximization, and another on estimation and maximization.
Evaluated the power of the two exact approaches.

Main Results:

The exact approach based on estimation and maximization demonstrated superior power compared to the maximization-based approach.
The proposed exact methods provide better type I error control.

Conclusions:

Exact sample size calculation methods, particularly the estimation and maximization approach, are recommended for two-rater agreement studies.
These methods ensure more accurate and reliable sample sizes, crucial for clinical trial validity.