Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Sensitivity, Specificity, and Predicted Value

Sensitivity, Specificity, and Predicted Value

In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...

Variation

Variation

An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Goodness-of-Fit Test

Goodness-of-Fit Test

The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...

Receiver Operating Characteristic Plot

Receiver Operating Characteristic Plot

A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Micro- and Nanoplastics as Potential Drivers of Dilated Cardiomyopathy.

Life (Basel, Switzerland)·2026

Same author

Equitable lipid optimisation through a data-driven, pharmacist-led secondary prevention pathway.

Open heart·2026

Same author

Aberrant splicing of MBD1 reshapes the epigenome to drive convergent myeloerythroid defects in MDS.

Blood·2026

Same author

Innervated Local Flap Reconstruction for Digital Soft Tissue Defects: A Systematic Review.

ANZ journal of surgery·2026

Same author

Comparative Study of Molecular Descriptors and AI-Based Embeddings for Toxicity Prediction.

Chemical research in toxicology·2025

Same author

Federated learning: a privacy-preserving approach to data-centric regulatory cooperation.

Frontiers in drug safety and regulation·2025

Same journal

Environmental pollutants as vascular poisons: rethinking mesenteric ischemia through the lens of heavy metal toxicity.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026

Same journal

Three decades of Nigerian bottled water quality research: a systematic review of contaminants and research gaps for environmental toxicology.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026

Same journal

Association between long-term exposure to ambient air pollution and breast cancer risk in Mexico: a systematic review of observational studies.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026

Same journal

Genotoxic effects of atmospheric PAH and heavy metals: Interaction and mechanistic evidence - a narrative review.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026

Same journal

Interactions between polystyrene-derived micro- and nanoplastics and the microbiota: a systematic review of multi-omics mouse studies.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026

Same journal

A critical review of styrene and styrene-7,8-oxide genotoxicity literature: an update.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 28, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

PERform: assessing model performance with predictivity and explainability readiness formula.

Leihong Wu¹, Joshua Xu¹, Weida Tong¹

¹Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, FDA, Jefferson, AR, USA.

Journal of Environmental Science and Health. Part C, Toxicology and Carcinogenesis

|April 15, 2024

Summary

This summary is machine-generated.

We developed PERForm, a unified formula integrating explainability into quantitative metrics for AI model evaluation. This approach provides a balanced assessment of predictivity and explainability, enhancing AI model selection and transparency.

Keywords:

Quantitative explainability measurement XAI explainable artificial intelligence predictive modeling

More Related Videos

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

Related Experiment Videos

Last Updated: Jun 28, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

Area of Science:

Artificial Intelligence
Computational Toxicology
Cheminformatics

Background:

Traditional AI model evaluation often separates performance and explainability, leading to subjective assessments.
Existing quantitative metrics primarily focus on model performance, neglecting interpretability.
There is a need for integrated, quantitative measures to assess both predictivity and explainability.

Purpose of the Study:

To introduce PERForm, a unified formula for quantitatively measuring both predictivity and explainability in AI models.
To provide a standardized method for evaluating and selecting AI models based on integrated performance and interpretability.
To advance transparency and interpretability in artificial intelligence applications.

Main Methods:

Developed the PERForm formula, incorporating explainability as a weighting factor into existing statistical performance metrics.
Applied the generic PERForm formula across diverse datasets (DILIst, Tox21, MAQC-II) and various modeling algorithms.
Evaluated 73 distinct endpoints to demonstrate the formula's applicability and utility.

Main Results:

PERForm successfully integrated explainability into quantitative AI model assessment.
Demonstrated varied model performances across datasets; AdaBoost excelled in DILIst prediction, while linear regression was superior for most Tox21 endpoints.
Provided quantitative evidence of the trade-offs between model performance and explainability.

Conclusions:

PERForm offers a robust, quantitative framework for evaluating AI models, balancing predictivity and explainability.
This approach facilitates more informed model selection, application, and development.
The research significantly contributes to enhancing transparency and interpretability in artificial intelligence.