Jove
Visualize
お問い合わせ
JoVE
x logofacebook logolinkedin logoyoutube logo
JoVEについて
概要リーダーシップブログJoVEヘルプセンター
著者向け
出版プロセス編集委員会範囲と方針査読よくある質問投稿
図書館員向け
推薦の声購読アクセスリソース図書館諮問委員会よくある質問
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experimentsアーカイブ
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教員リソースセンター教員サイト
利用規約
プライバシーポリシー
ポリシー

関連する概念動画

Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

522
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
522
Testing a Claim about Standard Deviation01:19

Testing a Claim about Standard Deviation

3.0K
A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...
3.0K
One-Way ANOVA: Unequal Sample Sizes01:15

One-Way ANOVA: Unequal Sample Sizes

6.8K
One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:
6.8K
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

4.2K
One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
4.2K
Ratio Level of Measurement00:54

Ratio Level of Measurement

21.8K
The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
A set of data measured using the ratio scale takes care of the ratio problem and provides complete information. Ratio scale data are like interval scale data, except they have a zero point and ratios can be calculated....
21.8K
Introduction to Normal Distributions01:29

Introduction to Normal Distributions

101
Standardized test scores often follow a symmetric distribution that can be modeled with the normal distribution, a fundamental concept in statistics. This distribution is particularly useful for interpreting test performance fairly across populations, as it provides a mathematical framework for understanding variability and central tendency in large datasets.From Histogram to Frequency DistributionRaw test data are often displayed using histograms, where the height of each bar represents the...
101

こちらも読む

関連記事

共著者、ジャーナル、引用グラフによってこの研究に関連する記事。

並び替え
Same author

Differential Item Functioning via Robust Scaling.

Psychometrika·2024
Same author

Measuring early learning and development across cultures: Invariance of the IDELA across five countries.

Developmental psychology·2018
Same author

Psychometric Models of Small Group Collaborations.

Psychometrika·2018
Same journal

BAYESIAN MIXED MULTIDIMENSIONAL SCALING FOR AUDITORY PROCESSING.

Psychometrika·2026
Same journal

Testing linear hypotheses in repeated measures generalized linear models using external information.

Psychometrika·2026
Same journal

When Do Unifactorial Items Increase the Reliability?

Psychometrika·2026
Same journal

Longitudinal Designs for Diagnostic Models: Identification and Estimation.

Psychometrika·2026
Same journal

Modeling Rare Events and Nonmonotone Nonignorable Missingness of Time-Varying Outcomes and Predictors in Binary Time-Series Daily Diary Data: A Bayesian Selection Model.

Psychometrika·2026
Same journal

Revelle's Beta: The Wait Is Over-Computation Becomes Possible.

Psychometrika·2026
関連記事をすべて見る

関連する実験動画

Updated: Feb 26, 2026

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

1.3K

ロバストスケーリングによる項目応答理論における項目特性値の差異検出

Peter F Halpin1

  • 1University of North Carolina at Chapel Hill.

Psychometrika
|February 25, 2026
PubMed
まとめ
この要約は機械生成です。

この研究は、項目応答理論(IRT)モデルにおける項目特性値の差異(DIF)を検出するための、アンカー項目を必要としない新しい方法を導入します。このアプローチは、DIFをロバスト統計を使用した外れ値検出として再定式化し、より柔軟で効果的な分析を提供します。

キーワード:
項目特性値の差異項目応答理論ロバスト統計テストスケーリングと等化

さらに関連する動画

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

6.3K
Qualitative and Quantitative Validation of Tools with Rating Scales Aimed at Assessing the Quality of University Service-Learning
10:39

Qualitative and Quantitative Validation of Tools with Rating Scales Aimed at Assessing the Quality of University Service-Learning

Published on: August 29, 2025

1.3K

関連する実験動画

Last Updated: Feb 26, 2026

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

1.3K
Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

6.3K
Qualitative and Quantitative Validation of Tools with Rating Scales Aimed at Assessing the Quality of University Service-Learning
10:39

Qualitative and Quantitative Validation of Tools with Rating Scales Aimed at Assessing the Quality of University Service-Learning

Published on: August 29, 2025

1.3K

科学分野:

  • 心理測定学
  • 教育測定学
  • 統計学

背景:

  • 項目特性値の差異(DIF)はテストの公平性にとって重要です。
  • 現在のDIF検出方法では、多くの場合、事前に指定されたアンカー項目が必要であり、その適用性が制限されています。
  • 項目応答理論(IRT)は、項目と個人の特性を分析するためのフレームワークを提供します。

研究 の 目的:

  • IRTモデルにおけるDIFを評価するための新しい方法を提案すること。
  • アンカー項目を必要としないDIF検出アプローチを開発すること。
  • DIF分析の頑健性と効率性を向上させること。

主な方法:

  • IRTスケーリング内での外れ値検出問題としてのDIFの再定式化。
  • パラメータ推定のためのロバスト統計、特に赤下げM推定値の使用。
  • DIF検出のための漸近第一種エラー率を制御するための推定値の調整。

主要な成果:

  • 提案された赤下げM推定値は、DIFが存在しない場合には効率的であり、DIFが存在する場合には頑健であることを実証しています。
  • シミュレーション研究では、既存のDIF検出方法と比較して良好な結果が得られました。
  • 実際のデータ例では、アンカー項目が実現不可能な場合のこの方法の実用的な応用が示されています。

結論:

  • 提案された方法は、特にアンカー項目が利用できない場合のDIF評価に実行可能な代替手段を提供します。
  • このロバストな統計的アプローチは、IRTにおけるDIF検出の信頼性を向上させます。
  • この調査結果は、教育的および心理的評価の公平性と妥当性の向上に影響を与えます。