Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Receiver Operating Characteristic Plot

Receiver Operating Characteristic Plot

A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...

Response Surface Methodology

Response Surface Methodology

Response Surface Methodology (RSM) is a collection of statistical and mathematical techniques used to develop, improve, and optimize processes. It is particularly valuable when many input variables or factors potentially influence a response variable.
The process of RSM involves several key steps:

Decision Making: Traditional Method

Decision Making: Traditional Method

The process of hypothesis testing based on the traditional method includes calculating the critical value, testing the value of the test statistic using the sample data, and interpreting these values.
First, a specific claim about the population parameter is decided based on the research question and is stated in a simple form. Further, an opposing statement to this claim is also stated. These statements can act as null and alternative hypotheses, out of which a null hypothesis would be a...

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

Testing a Claim about Standard Deviation

Testing a Claim about Standard Deviation

A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...

Critical Region, Critical Values and Significance Level

Critical Region, Critical Values and Significance Level

The critical region, critical value, and significance level are interdependent concepts crucial in hypothesis testing.
In hypothesis testing, a sample statistic is converted to a test statistic using z, t, or chi-square distribution. A critical region is an area under the curve in probability distributions demarcated by the critical value. When the test statistic falls in this region, it suggests that the null hypothesis must be rejected. As this region contains all those values of the...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Few and Different: Detecting Examinees With Preknowledge Using Extended Isolation Forests.

Applied psychological measurement·2025

Same author

Profiles of Adoptee Adjustment in Young Adulthood.

Adoption quarterly·2023

Same author

Pilot Study of a Patient-Centered Radiology Process Model.

Journal of the American College of Radiology : JACR·2016

Same author

Clinical outcomes following hospital-wide implementation of prolonged-infusion cefepime and ceftazidime.

International journal of antimicrobial agents·2015

Same author

Using multivariate generalizability theory to assess the effect of content stratification on the reliability of a performance assessment.

Advances in health sciences education : theory and practice·2010

Same author

Assessing the impact of modifications to the documentation component's scoring rubric and rater training on USMLE integrated clinical encounter scores.

Academic medicine : journal of the Association of American Medical Colleges·2009

Same journal

A Simple Approach for Differential Test Functioning Based on Sum Scores.

Educational and psychological measurement·2026

Same journal

Evaluating Factor Retention in Large Factor Analysis Models: A Simulation Study Comparing 15 Methods.

Educational and psychological measurement·2026

Same journal

Agreement and Alignment in Binary Rating Tasks: Strategic Convergence as an Equilibrium Outcome.

Educational and psychological measurement·2026

Same journal

Interactions Between Termination Criteria and Ability Estimators in Computerized Adaptive Testing.

Educational and psychological measurement·2026

Same journal

Identification and Diagnosis of Misreporting in Surveys.

Educational and psychological measurement·2026

Same journal

The Aggregated Latent Profile Index: Measuring Person Profile Differentiation Within a Bootstrap-Validated Latent Profile Space.

Educational and psychological measurement·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jun 7, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

使用ROC分析来根据标准设定流程完善切割分数.

Dongwei Wang¹, Lisa A Keller¹

¹University of Massachusetts Amherst, USA.

Educational and psychological measurement

|November 18, 2024

概括

此摘要是机器生成的。

优化教育评估削减分数包括考虑样本分布,流行率和成本比率. 根据这些因素调整切割分数可以提高分类准确性,特别是在低流行率的场景中.

关键词:

在ROC分析中,ROC分析切割得分切割得分切割得分精制精制精制精制设定标准的标准设置标准.

更多相关视频

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Published on: August 30, 2013

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

相关实验视频

Last Updated: Jun 7, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Published on: August 30, 2013

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

科学领域:

教育测量教育的测量
心理测量心理测量心理测量
统计分析统计分析

背景情况:

标准设置定义了教育评估中的切割分数,使用学科专家.
改进切割分数需要统计和理论证据来提高分类准确性.

研究的目的:

研究样本分布,流行率和成本比对分类准确性的影响.
提供统计证据来完善教育评估中的切割分数.
检查接收器操作特征 (ROC) 分析如何为切割得分调整提供信息.

主要方法:

模拟了四个样本分布的40个项目响应.
操纵了积极事件的流行率和成本比率 (虚假负面与虚假阳性).
使用接收器操作特征 (ROC) 分析和尤登指数 (J) 来确定最佳切割分数.

主要成果:

最佳的切割分数转向了能力分布的模式.
削减得分调整受流行率和成本比率的影响.
增加切割得分可以改善低流行事件的分类;降低高流行事件的分类.
较高的成本比率导致较低的最佳切割得分.

结论:

切割分数的精细化对于准确的教育评估至关重要.
统计证据支持根据流行率和成本比率调整切割得分.
调查结果为政策决策提供了指导,以优化切割得分.