Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Test for Homogeneity

Test for Homogeneity

The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can be stated as...

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

The Anderson-Darling Test

The Anderson-Darling Test

The Anderson-Darling test is a statistical method used to determine whether a data sample is likely drawn from a specific theoretical distribution. Unlike parametric tests, it does not require assumptions about specific parameters of the distribution. Instead, it compares the sample's empirical cumulative distribution function (ECDF) with the cumulative distribution function (CDF) of the hypothesized distribution. Critical values for the test are specific to the chosen distribution rather than...

Wald-Wolfowitz Runs Test I

Wald-Wolfowitz Runs Test I

The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with data...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Detecting Test Speededness Using Responses and/or Response Times: Change Point Analysis Approaches Based on Schwarz Information Criterion.

Psychometrika·2026

Same author

Using multilabel classification neural network to detect intersectional DIF with small sample sizes.

The British journal of mathematical and statistical psychology·2026

Same author

A multi-strategy cognitive diagnosis model based on response times and fixation counts.

Behavior research methods·2026

Same author

A Diagnostic Facet Status Model (DFSM) for Extracting Instructionally Useful Information from Diagnostic Assessment.

Psychometrika·2026

Same author

Calibrating Multidimensional Assessments With Structural Missingness: An Application of a Multiple-Group Higher-Order IRT Model.

Applied psychological measurement·2026

Same author

Robot-Assisted Dynamic Interaction of Hemiplegic Upper Limbs with Complex Objects Based on Enhanced Feedforward-Impedance Control.

Biomimetics (Basel, Switzerland)·2025

Same journal

babebi: An R Package for Bayesian Estimation and Validation in Small-N Two-Rater Pre-Post Designs.

Applied psychological measurement·2026

Same journal

A Tool for Agreement and Alignment Analysis in Binary Rating Tasks: The R Package scindex.

Applied psychological measurement·2026

Same journal

The EM Algorithm and Its Variants in Cognitive Diagnostic Models: Comparing Their Propensity for Boundaries, Extremes, Convergence, and Suboptimal Solutions.

Applied psychological measurement·2026

Same journal

When Perceptions of Social Desirability Differ: Implications for the Multidimensional Nominal Response Model of Faking.

Applied psychological measurement·2026

Same journal

csemGT: An R Package for Estimating Raw-Score Conditional Standard Errors of Measurement in Generalizability Theory.

Applied psychological measurement·2026

Same journal

Confirmatory Factor Analysis with Adaptive Quadrature Estimator Using Four Link Functions.

Applied psychological measurement·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: May 14, 2026

A Tactile Automated Passive-Finger Stimulator TAPS

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

检测连续响应的统一差异性项目功能,进行计算机化自适应测试.

Chun Wang¹, Ruoyi Zhu¹

¹University of Washington, WA, USA.

Applied psychological measurement

|February 8, 2024

概括

此摘要是机器生成的。

我们开发了两种方法来检测在计算机化适应性测试 (CAT) 中的差异性项目功能 (DIF),使用连续的响应和稀疏的数据. 这两种方法都有效地确定了统一的DIF,确保在先进的测试场景中进行公平的测量.

关键词:

第七个测试计算机化的适应性测试.连续响应连续响应.差异性项目的功能.

更多相关视频

Computerized Adaptive Testing System of Functional Assessment of Stroke

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

相关实验视频

Last Updated: May 14, 2026

A Tactile Automated Passive-Finger Stimulator TAPS

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

Computerized Adaptive Testing System of Functional Assessment of Stroke

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

科学领域:

心理测量心理测量心理测量
教育测量教育的测量
计算机化的适应性测试 (CAT)

背景情况:

确保衡量的公平性需要对差异性项目功能 (DIF) 的项目进行评估.
连续响应项目提供了比二分类项目更多的信息,特别是在基于绩效的任务中.
在计算机自适应测试 (CAT) 中,当项目是机器生成的时,严重的数据稀疏性是常见的.

研究的目的:

提出和评估两种新的方法来检测统一的DIF在特定的背景下持续响应,严重稀疏的CAT.
评估这些方法在具有挑战性的数据条件下识别DIF的有效性.

主要方法:

一种修改的非参数CAT-SIBTEST方法,独立于项目响应理论 (IRT) 模型假设.
一种参数,基于模型的规范化方法.
进行模拟研究以评估方法性能.

主要成果:

两种拟议的方法都在准确识别展示均DIF的物品方面表现出有效性.
模拟研究证实了在特定的CAT场景中开发的技术的稳定性.

结论:

开发的CAT-SIBTEST修改和规范化方法适用于检测连续响应中均的DIF,严重稀疏的CAT.
这些方法有助于在先进的,数据密集型测试环境中确保测量公平性.
提供真实数据分析,以说明实际应用和潜在的限制.