Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Censoring Survival Data

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Statically Indeterminate Problem Solving

Statically Indeterminate Problem Solving

Statically indeterminate problems are those where statics alone can not determine the internal forces or reactions. Consider a structure comprising two cylindrical rods made of steel and brass. These rods are joined at point B and restrained by rigid supports at points A and C. Now, the reactions at points A and C and the deflection at point B are to be determined. This rod structure is classified as statically indeterminate as the structure has more supports than are necessary for maintaining...

Machines: Problem Solving II

Machines: Problem Solving II

Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Towards robust foundation models for digital pathology.

Nature communications·2026

Same author

Modeling attention and binding in the brain through bidirectional recurrent gating.

Nature communications·2026

Same author

How simple can you go? An off-the-shelf transformer approach to molecular dynamics.

The Journal of chemical physics·2026

Same author

Abnormal hippocampo-cortical theta-gamma phase-amplitude coupling in Alzheimer's disease.

medRxiv : the preprint server for health sciences·2026

Same author

Software for dataset-wide XAI: From local explanations to global insights with Zennit, CoRelAy, and ViRelAy.

PloS one·2026

Same author

Peering inside the black box by learning the relevance of many-body functions in neural network potentials.

Nature communications·2025

Same journal

Your Next State-of-the-Art Could Come from Another Domain: A Cross-Domain Analysis of Hierarchical Text Classification.

Machine learning·2026

Same journal

Linear Causal Discovery with Interventional Constraints.

Machine learning·2026

Same journal

Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models.

Machine learning·2025

Same journal

Mining exceptional social behavior on attributed interaction networks.

Machine learning·2025

Same journal

Persistent Laplacian-enhanced algorithm for scarcely labeled data classification.

Machine learning·2025

Same journal

Ensuring medical AI safety: interpretability-driven detection and mitigation of spurious model behavior and associated data.

Machine learning·2025

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 22, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Scrutinizing XAI using linear ground-truth data with suppressor variables.

Rick Wilming¹, Céline Budding², Klaus-Robert Müller^1,3,4,5

¹Technische Universität, Berlin, Germany.

Machine Learning

|May 25, 2022

Summary

This summary is machine-generated.

Explainable AI (XAI) methods often struggle to validate feature importance. This study proposes a new definition for feature importance, showing most current XAI techniques fail to distinguish true importance from suppressor variables.

Keywords:

Benchmark Explainable AI Ground truth Linear classification Saliency methods Suppressor variables

More Related Videos

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

Published on: May 16, 2025

Related Experiment Videos

Last Updated: Sep 22, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

Published on: May 16, 2025

Area of Science:

Artificial Intelligence
Machine Learning
Explainable AI (XAI)

Background:

Complex machine learning models are often "black boxes", necessitating explainable AI (XAI) techniques.
Saliency methods in XAI rank input features by importance, but lack formal validation and can highlight irrelevant "suppressor variables".

Purpose of the Study:

To propose an objective, preliminary definition for feature importance based on statistical association with the prediction target.
To evaluate the performance of common XAI saliency methods against this new definition, particularly concerning suppressor variables.

Main Methods:

Developed a ground-truth dataset with well-defined, linear statistical dependencies.
Evaluated multiple XAI methods (LRP, DTD, PatternNet, PatternAttribution, LIME, Anchors, SHAP, permutation-based) on the benchmark dataset.
Assessed the ability of these methods to differentiate statistically important features from suppressor variables.

Main Results:

Most evaluated XAI saliency methods failed to distinguish between statistically important features and suppressor variables.
The proposed objective definition highlighted limitations in current feature importance assessment within XAI.

Conclusions:

A formal definition of feature importance is crucial for reliable XAI.
Current popular XAI saliency methods require refinement to accurately identify true feature importance and avoid misinterpretations from suppressor variables.