Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

305
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
305
Variation01:19

Variation

6.8K
An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...
6.8K
Prediction Intervals01:03

Prediction Intervals

2.3K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.3K
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

3.3K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
3.3K
Receiver Operating Characteristic Plot01:15

Receiver Operating Characteristic Plot

150
A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...
150

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Micro- and Nanoplastics as Potential Drivers of Dilated Cardiomyopathy.

Life (Basel, Switzerland)·2026
Same author

Equitable lipid optimisation through a data-driven, pharmacist-led secondary prevention pathway.

Open heart·2026
Same author

Aberrant splicing of MBD1 reshapes the epigenome to drive convergent myeloerythroid defects in MDS.

Blood·2026
Same author

Innervated Local Flap Reconstruction for Digital Soft Tissue Defects: A Systematic Review.

ANZ journal of surgery·2026
Same author

Comparative Study of Molecular Descriptors and AI-Based Embeddings for Toxicity Prediction.

Chemical research in toxicology·2025
Same author

Federated learning: a privacy-preserving approach to data-centric regulatory cooperation.

Frontiers in drug safety and regulation·2025
Same journal

Environmental pollutants as vascular poisons: rethinking mesenteric ischemia through the lens of heavy metal toxicity.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

Three decades of Nigerian bottled water quality research: a systematic review of contaminants and research gaps for environmental toxicology.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

Association between long-term exposure to ambient air pollution and breast cancer risk in Mexico: a systematic review of observational studies.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

Genotoxic effects of atmospheric PAH and heavy metals: Interaction and mechanistic evidence - a narrative review.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

Interactions between polystyrene-derived micro- and nanoplastics and the microbiota: a systematic review of multi-omics mouse studies.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

A critical review of styrene and styrene-7,8-oxide genotoxicity literature: an update.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
查看所有相关文章

相关实验视频

Updated: Jun 28, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

758

性能:通过预测性和可解释性准备度公式评估模型性能.

Leihong Wu1, Joshua Xu1, Weida Tong1

  • 1Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, FDA, Jefferson, AR, USA.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis
|April 15, 2024
PubMed
概括
此摘要是机器生成的。

我们开发了PERForm,这是一个统一的公式,将可解释性整合到用于AI模型评估的定量指标中. 这种方法提供了一个平衡的预测性和可解释性评估,增强AI模型选择和透明度.

关键词:
量化可解释性测量测量量量化可解释性测量量.在XAI,XAI就是XAI.可解释的人工智能预测建模预测建模

更多相关视频

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.4K
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

7.1K

相关实验视频

Last Updated: Jun 28, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

758
Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.4K
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

7.1K

科学领域:

  • 人工智能的人工智能
  • 计算毒理学计算毒理学
  • 化学信息学 化学信息学

背景情况:

  • 传统的人工智能模型评估通常将性能和可解释性分开,导致主观评估.
  • 现有的定量指标主要集中在模型性能上,忽视了可解释性.
  • 需要综合的量化措施来评估预测性和可解释性.

研究的目的:

  • 介绍PERForm,这是一个统一的公式,用于在AI模型中量化测量预测性和可解释性.
  • 为基于综合性能和可解释性评估和选择AI模型提供标准化的方法.
  • 推进人工智能应用中的透明度和可解释性.

主要方法:

  • 开发了PERForm公式,将可解释性作为权重因素纳入现有的统计绩效指标中.
  • 在各种数据集 (DILIst,Tox21,MAQC-II) 和各种建模算法中应用了通用的PERForm公式.
  • 评估了73个不同的终点,以证明公式的适用性和实用性.

主要成果:

  • PERForm成功地将可解释性集成到定量AI模型评估中.
  • 在数据集中展示了多样化的模型性能;AdaBoost在DILIst预测中表现出色,而线性回归对于大多数Tox21终点优越.
  • 提供了模型性能和可解释性之间的权衡的定量证据.

结论:

  • PERForm提供了一个强大的,定量框架来评估AI模型,平衡预测性和可解释性.
  • 这种方法有助于更明智的模型选择,应用和开发.
  • 这项研究有助于提高人工智能的透明度和可解释性.