Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Censoring Survival Data01:09

Censoring Survival Data

258
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
258
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.2K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.2K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

166
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
166
Statically Indeterminate Problem Solving01:16

Statically Indeterminate Problem Solving

508
Statically indeterminate problems are those where statics alone can not determine the internal forces or reactions. Consider a structure comprising two cylindrical rods made of steel and brass. These rods are joined at point B and restrained by rigid supports at points A and C. Now, the reactions at points A and C and the deflection at point B are to be determined. This rod structure is classified as statically indeterminate as the structure has more supports than are necessary for maintaining...
508
Machines: Problem Solving II01:30

Machines: Problem Solving II

380
Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.
380
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

6.4K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
6.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Towards robust foundation models for digital pathology.

Nature communications·2026
Same author

Modeling attention and binding in the brain through bidirectional recurrent gating.

Nature communications·2026
Same author

How simple can you go? An off-the-shelf transformer approach to molecular dynamics.

The Journal of chemical physics·2026
Same author

Abnormal hippocampo-cortical theta-gamma phase-amplitude coupling in Alzheimer's disease.

medRxiv : the preprint server for health sciences·2026
Same author

Software for dataset-wide XAI: From local explanations to global insights with Zennit, CoRelAy, and ViRelAy.

PloS one·2026
Same author

Peering inside the black box by learning the relevance of many-body functions in neural network potentials.

Nature communications·2025
Same journal

Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification.

Machine learning·2026
Same journal

Linear Causal Discovery with Interventional Constraints.

Machine learning·2026
Same journal

Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models.

Machine learning·2025
Same journal

Mining exceptional social behavior on attributed interaction networks.

Machine learning·2025
Same journal

Persistent Laplacian-enhanced algorithm for scarcely labeled data classification.

Machine learning·2025
Same journal

Ensuring medical AI safety: interpretability-driven detection and mitigation of spurious model behavior and associated data.

Machine learning·2025
See all related articles

Related Experiment Video

Updated: Sep 22, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

604

Scrutinizing XAI using linear ground-truth data with suppressor variables.

Rick Wilming1, Céline Budding2, Klaus-Robert Müller1,3,4,5

  • 1Technische Universität, Berlin, Germany.

Machine Learning
|May 25, 2022
PubMed
Summary
This summary is machine-generated.

Explainable AI (XAI) methods often struggle to validate feature importance. This study proposes a new definition for feature importance, showing most current XAI techniques fail to distinguish true importance from suppressor variables.

Keywords:
BenchmarkExplainable AIGround truthLinear classificationSaliency methodsSuppressor variables

More Related Videos

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.2K
A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents
06:25

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

Published on: May 16, 2025

636

Related Experiment Videos

Last Updated: Sep 22, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

604
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.2K
A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents
06:25

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

Published on: May 16, 2025

636

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Explainable AI (XAI)

Background:

  • Complex machine learning models are often "black boxes", necessitating explainable AI (XAI) techniques.
  • Saliency methods in XAI rank input features by importance, but lack formal validation and can highlight irrelevant "suppressor variables".

Purpose of the Study:

  • To propose an objective, preliminary definition for feature importance based on statistical association with the prediction target.
  • To evaluate the performance of common XAI saliency methods against this new definition, particularly concerning suppressor variables.

Main Methods:

  • Developed a ground-truth dataset with well-defined, linear statistical dependencies.
  • Evaluated multiple XAI methods (LRP, DTD, PatternNet, PatternAttribution, LIME, Anchors, SHAP, permutation-based) on the benchmark dataset.
  • Assessed the ability of these methods to differentiate statistically important features from suppressor variables.

Main Results:

  • Most evaluated XAI saliency methods failed to distinguish between statistically important features and suppressor variables.
  • The proposed objective definition highlighted limitations in current feature importance assessment within XAI.

Conclusions:

  • A formal definition of feature importance is crucial for reliable XAI.
  • Current popular XAI saliency methods require refinement to accurately identify true feature importance and avoid misinterpretations from suppressor variables.