Homogeneity test and sample size of response rates for AC 1 in a stratified evaluation design

  • 0College of Mathematics and System Science, 47907 Xinjiang University , Urumqi, China.
The international journal of biostatistics +

|

Abstract

Gwet's first-order agreement coefficient (AC 1) is widely used to evaluate the consistency between raters. Considering the existence of a certain relationship between the raters, the paper aims to test the equality of response rates and the dependency between two raters of modified AC 1's in a stratified design and estimates the sample size for a given significance level. We first establish a probability model and then estimate the unknown parameters. Further, we explore the homogeneity test of these AC 1's under the asymptotic method, such as likelihood ratio, score, and Wald-type statistics. In numerical simulation, the performance of statistics is investigated in terms of type I error rates (TIEs) and power while finding a suitable sample size under a given power. The results show that the Wald-type statistic has robust TIEs and satisfactory power and is suitable for large samples (n≥50). Under the same power, the sample size of the Wald-type test is smaller when the number of strata is large. The higher the power, the larger the required sample size. Finally, two real examples are given to illustrate these methods.

Related Concept Videos

Test for Homogeneity 01:23

1.9K

The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...

One-Way ANOVA: Equal Sample Sizes 01:15

3.1K

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

One-Way ANOVA: Unequal Sample Sizes 01:15

5.6K

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

In the equation, n is the sample size,  ͞x is the sample mean, x̿  is the combined mean for all the observations, k  is the number of samples, and s2 is the variance of the sample. It should be noted that the subscript 'i'...

Sample Proportion and Population Proportion 01:20

5.1K

Collecting samples or responses from an entire population takes significant time and effort, so a researcher collects responses from only a sample of that population. Suppose a study needs to collect information about a specific mobile application. After sample collection, the researcher analyzes the data and discovers that most individuals in the sample use that specific mobile application. The sample proportion measures the number of individuals in a sample who either use or don't use the...

Goodness-of-Fit Test 01:16

3.2K

The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is...

Wilcoxon Rank-Sum Test 01:21

105

The Wilcoxon rank-sum test, also known as the Mann-Whitney U test, is a nonparametric test used to determine if there is a significant difference between the distributions of two independent samples. This test is designed specifically for two independent populations and has the following key requirements:

The samples must be randomly drawn.
The data should be ordinal or capable of being converted to an ordinal scale, allowing the values to be ordered and ranked.

The null hypothesis is that...