Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

305
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
305
Variation01:19

Variation

6.8K
An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...
6.8K
Prediction Intervals01:03

Prediction Intervals

2.3K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.3K
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

3.3K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
3.3K
Receiver Operating Characteristic Plot01:15

Receiver Operating Characteristic Plot

150
A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...
150

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Micro- and Nanoplastics as Potential Drivers of Dilated Cardiomyopathy.

Life (Basel, Switzerland)·2026
Same author

Equitable lipid optimisation through a data-driven, pharmacist-led secondary prevention pathway.

Open heart·2026
Same author

Aberrant splicing of MBD1 reshapes the epigenome to drive convergent myeloerythroid defects in MDS.

Blood·2026
Same author

Innervated Local Flap Reconstruction for Digital Soft Tissue Defects: A Systematic Review.

ANZ journal of surgery·2026
Same author

Comparative Study of Molecular Descriptors and AI-Based Embeddings for Toxicity Prediction.

Chemical research in toxicology·2025
Same author

Federated learning: a privacy-preserving approach to data-centric regulatory cooperation.

Frontiers in drug safety and regulation·2025
Same journal

Environmental pollutants as vascular poisons: rethinking mesenteric ischemia through the lens of heavy metal toxicity.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

Three decades of Nigerian bottled water quality research: a systematic review of contaminants and research gaps for environmental toxicology.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

Association between long-term exposure to ambient air pollution and breast cancer risk in Mexico: a systematic review of observational studies.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

Genotoxic effects of atmospheric PAH and heavy metals: Interaction and mechanistic evidence - a narrative review.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

Interactions between polystyrene-derived micro- and nanoplastics and the microbiota: a systematic review of multi-omics mouse studies.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
Same journal

A critical review of styrene and styrene-7,8-oxide genotoxicity literature: an update.

Journal of environmental science and health. Part C, Toxicology and carcinogenesis·2026
See all related articles

Related Experiment Video

Updated: Jun 28, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

758

PERform: assessing model performance with predictivity and explainability readiness formula.

Leihong Wu1, Joshua Xu1, Weida Tong1

  • 1Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, FDA, Jefferson, AR, USA.

Journal of Environmental Science and Health. Part C, Toxicology and Carcinogenesis
|April 15, 2024
PubMed
Summary
This summary is machine-generated.

We developed PERForm, a unified formula integrating explainability into quantitative metrics for AI model evaluation. This approach provides a balanced assessment of predictivity and explainability, enhancing AI model selection and transparency.

Keywords:
Quantitative explainability measurementXAIexplainable artificial intelligencepredictive modeling

More Related Videos

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.4K
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

7.1K

Related Experiment Videos

Last Updated: Jun 28, 2025

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

758
Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.4K
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

7.1K

Area of Science:

  • Artificial Intelligence
  • Computational Toxicology
  • Cheminformatics

Background:

  • Traditional AI model evaluation often separates performance and explainability, leading to subjective assessments.
  • Existing quantitative metrics primarily focus on model performance, neglecting interpretability.
  • There is a need for integrated, quantitative measures to assess both predictivity and explainability.

Purpose of the Study:

  • To introduce PERForm, a unified formula for quantitatively measuring both predictivity and explainability in AI models.
  • To provide a standardized method for evaluating and selecting AI models based on integrated performance and interpretability.
  • To advance transparency and interpretability in artificial intelligence applications.

Main Methods:

  • Developed the PERForm formula, incorporating explainability as a weighting factor into existing statistical performance metrics.
  • Applied the generic PERForm formula across diverse datasets (DILIst, Tox21, MAQC-II) and various modeling algorithms.
  • Evaluated 73 distinct endpoints to demonstrate the formula's applicability and utility.

Main Results:

  • PERForm successfully integrated explainability into quantitative AI model assessment.
  • Demonstrated varied model performances across datasets; AdaBoost excelled in DILIst prediction, while linear regression was superior for most Tox21 endpoints.
  • Provided quantitative evidence of the trade-offs between model performance and explainability.

Conclusions:

  • PERForm offers a robust, quantitative framework for evaluating AI models, balancing predictivity and explainability.
  • This approach facilitates more informed model selection, application, and development.
  • The research significantly contributes to enhancing transparency and interpretability in artificial intelligence.