Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

One-Way ANOVA: Unequal Sample Sizes

One-Way ANOVA: Unequal Sample Sizes

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

Identifying Statistically Significant Differences: The F-Test

Identifying Statistically Significant Differences: The F-Test

The F-test is used to compare two sample variances to each other or compare the sample variance to the population variance. It is used to decide whether an indeterminate error can explain the difference in their values. The underlying assumptions that allow the use of the F-test include the data set or sets are normally distributed, and the data sets are independent of each other. The test statistic F is calculated by dividing one variance by another. In other words, the square of one standard...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Testing a Claim about Standard Deviation

Testing a Claim about Standard Deviation

A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...

Comparing Experimental Results: Student's t-Test

Comparing Experimental Results: Student's t-Test

The t-test is a statistical method used to compare the sample mean with a population mean or compare two means from two data sets. The test statistic is calculated from the standard deviation, mean, and number of measurements in the data set at a selected confidence interval and then compared to a table of critical values at this confidence level. If the test statistic is smaller than the critical value, the null hypothesis is accepted. In this case, we state that the difference between the...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

AgrAbility Quality of Life Profile Transitions and Relationships with Independent Living and Working.

Journal of agromedicine·2026

Same author

The Illness Management and Recovery Scale: Adaptation and Validation Study of the Spanish Version.

Evaluation & the health professions·2026

Same author

The Dispositional Hope Scale in Spanish-speaking users of mental health services: validation and normative data.

BMC psychology·2026

Same author

Psychometric evaluation of California Verbal Learning Test second edition short form (CVLT-II SF) score validity in American Indian adults: The Strong Heart Study.

Neuropsychology·2025

Same author

Psychometric properties of the NIH Toolbox Cognition Battery composites in older adults at risk for Alzheimer's disease and related dementias: A systematic review.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2025

Same author

Exploring AgrAbility Quality of Life Profiles.

Journal of agromedicine·2025

Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026

Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026

Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026

Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026

Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026

Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jun 6, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

差异性项目功能效果尺寸用于有效性信息使用.

W Holmes Finch¹, Maria Dolores Hidalgo Montesinos², Brian F French³

¹Ball State University, Muncie, IN, USA.

Educational and psychological measurement

|November 25, 2024

概括

此摘要是机器生成的。

效果大小有助于量化差异物品功能 (DIF) 大小. 在模拟研究中,日志概率比率和Mantle-Haenszel日志概率比率差异准确地确定了哪个评估具有更多的DIF.

关键词:

差异性项目的功能.效果大小效果大小的影响.的有效性有效性.

更多相关视频

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Published on: September 11, 2021

A Two-interval Forced-choice Task for Multisensory Comparisons

A Two-interval Forced-choice Task for Multisensory Comparisons

Published on: November 9, 2018

相关实验视频

Last Updated: Jun 6, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Published on: September 11, 2021

A Two-interval Forced-choice Task for Multisensory Comparisons

A Two-interval Forced-choice Task for Multisensory Comparisons

Published on: November 9, 2018

科学领域:

心理测量心理测量心理测量
教育测量教育的测量
统计分析统计分析

背景情况:

对差异性项目功能 (DIF) 的统计显著性测试缺乏大小解释.
效应大小对于理解检测到的DIF的实际意义至关重要.
对于DIF分析,存在各种效果大小测量和解释准则.

研究的目的:

为了比较DIF效应大小的表现,在量化和比较DIF的两个评估中进行量化和比较.
评估效果大小是否准确地捕捉了总体DIF,并识别了DIF较少的评估.
在各种模拟条件下识别可靠的DIF效应大小指标.

主要方法:

进行了一项模拟研究,操纵影响效果大小和DIF检测的因素.
对比了不同DIF效应大小指标的性能.
效果大小应用于真实数据集,以实践示例.

主要成果:

固定效应的日志概率比率 (Ln) 和曼特尔-汉泽尔日志概率比率的方差 (Mantel-Haenszel log odds ratio) 显示出高准确性.
这些措施有效地确定了哪项评估显示出更高的DIF金额.
几种效果大小在各种模拟场景中显示出可靠的性能.

结论:

建议使用日志概率比率和Mantle-Haenszel日志概率比率差异来量化DIF大小,并比较评估之间的DIF水平.
这些效应大小在DIF分析中提供了超出统计意义的宝贵见解.
进一步的研究应该集中在效应大小上,以提高对DIF大小的理解.