Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Test for Homogeneity01:23

Test for Homogeneity

The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can be stated as...
Multiple Comparison Tests01:13

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
The Anderson-Darling Test01:16

The Anderson-Darling Test

The Anderson-Darling test is a statistical method used to determine whether a data sample is likely drawn from a specific theoretical distribution. Unlike parametric tests, it does not require assumptions about specific parameters of the distribution. Instead, it compares the sample's empirical cumulative distribution function (ECDF) with the cumulative distribution function (CDF) of the hypothesized distribution. Critical values for the test are specific to the chosen distribution rather than...
Wald-Wolfowitz Runs Test I01:17

Wald-Wolfowitz Runs Test I

The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...
Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test01:09

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with data...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Detecting Test Speededness Using Responses and/or Response Times: Change Point Analysis Approaches Based on Schwarz Information Criterion.

Psychometrika·2026
Same author

Using multilabel classification neural network to detect intersectional DIF with small sample sizes.

The British journal of mathematical and statistical psychology·2026
Same author

A multi-strategy cognitive diagnosis model based on response times and fixation counts.

Behavior research methods·2026
Same author

A Diagnostic Facet Status Model (DFSM) for Extracting Instructionally Useful Information from Diagnostic Assessment.

Psychometrika·2026
Same author

Calibrating Multidimensional Assessments With Structural Missingness: An Application of a Multiple-Group Higher-Order IRT Model.

Applied psychological measurement·2026
Same author

Robot-Assisted Dynamic Interaction of Hemiplegic Upper Limbs with Complex Objects Based on Enhanced Feedforward-Impedance Control.

Biomimetics (Basel, Switzerland)·2025
Same journal

babebi: An R Package for Bayesian Estimation and Validation in Small-N Two-Rater Pre-Post Designs.

Applied psychological measurement·2026
Same journal

A Tool for Agreement and Alignment Analysis in Binary Rating Tasks: The R Package scindex.

Applied psychological measurement·2026
Same journal

The EM Algorithm and Its Variants in Cognitive Diagnostic Models: Comparing Their Propensity for Boundaries, Extremes, Convergence, and Suboptimal Solutions.

Applied psychological measurement·2026
Same journal

When Perceptions of Social Desirability Differ: Implications for the Multidimensional Nominal Response Model of Faking.

Applied psychological measurement·2026
Same journal

csemGT: An R Package for Estimating Raw-Score Conditional Standard Errors of Measurement in Generalizability Theory.

Applied psychological measurement·2026
Same journal

Confirmatory Factor Analysis with Adaptive Quadrature Estimator Using Four Link Functions.

Applied psychological measurement·2026
查看所有相关文章

相关实验视频

Updated: May 14, 2026

A Tactile Automated Passive-Finger Stimulator TAPS
19:44

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

13.7K

检测连续响应的统一差异性项目功能,进行计算机化自适应测试.

Chun Wang1, Ruoyi Zhu1

  • 1University of Washington, WA, USA.

Applied psychological measurement
|February 8, 2024
PubMed
概括
此摘要是机器生成的。

我们开发了两种方法来检测在计算机化适应性测试 (CAT) 中的差异性项目功能 (DIF),使用连续的响应和稀疏的数据. 这两种方法都有效地确定了统一的DIF,确保在先进的测试场景中进行公平的测量.

关键词:
第七个测试计算机化的适应性测试.连续响应连续响应.差异性项目的功能.

更多相关视频

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

5.8K
Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

768

相关实验视频

Last Updated: May 14, 2026

A Tactile Automated Passive-Finger Stimulator TAPS
19:44

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

13.7K
Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

5.8K
Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

768

科学领域:

  • 心理测量 心理测量 心理测量
  • 教育测量教育的测量
  • 计算机化的适应性测试 (CAT)

背景情况:

  • 确保衡量的公平性需要对差异性项目功能 (DIF) 的项目进行评估.
  • 连续响应项目提供了比二分类项目更多的信息,特别是在基于绩效的任务中.
  • 在计算机自适应测试 (CAT) 中,当项目是机器生成的时,严重的数据稀疏性是常见的.

研究的目的:

  • 提出和评估两种新的方法来检测统一的DIF在特定的背景下持续响应,严重稀疏的CAT.
  • 评估这些方法在具有挑战性的数据条件下识别DIF的有效性.

主要方法:

  • 一种修改的非参数CAT-SIBTEST方法,独立于项目响应理论 (IRT) 模型假设.
  • 一种参数,基于模型的规范化方法.
  • 进行模拟研究以评估方法性能.

主要成果:

  • 两种拟议的方法都在准确识别展示均DIF的物品方面表现出有效性.
  • 模拟研究证实了在特定的CAT场景中开发的技术的稳定性.

结论:

  • 开发的CAT-SIBTEST修改和规范化方法适用于检测连续响应中均的DIF,严重稀疏的CAT.
  • 这些方法有助于在先进的,数据密集型测试环境中确保测量公平性.
  • 提供真实数据分析,以说明实际应用和潜在的限制.