Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Outliers and Influential Points

Outliers and Influential Points

An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...

Weighted Mean

Weighted Mean

While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...

What is Central Tendency?

What is Central Tendency?

Descriptive statistics describe or summarize relevant characteristics of a sample and aid in the analysis of data of interest. When analyzing large quantities of data and developing an inference, one needs to identify a value representative of the entire data set. Characteristics such as central tendency, extreme values, range of measurements, or the most repeated value can help better understand the data.
The central tendency is the most conventionally used data characteristic. It is a...

Variability: Analysis

Variability: Analysis

Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...

Statistical Analysis: Overview

Statistical Analysis: Overview

When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...

Significance Testing: Overview

Significance Testing: Overview

Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Leucine Aminopeptidase-Responsive Bioluminescent Probe for Point-of-Care Diagnosis of Pancreatic Cancer in Blood and Urine.

Analytical chemistry·2026

Same author

A generative approach for semantic auditing of electronic health records.

NPJ digital medicine·2026

Same author

FOXM1 expression is induced by the brain microenvironment and supports CRC brain metastatic adaptation.

Clinical & experimental metastasis·2026

Same author

Gut microbiota composition correlates with PBMC microRNA expression following maximal exercise testing in endurance athletes.

Frontiers in microbiomes·2026

Same author

Artificial Intelligence Does Not Always Win.

The Israel Medical Association journal : IMAJ·2026

Same author

Revisiting low penetrance retinoblastoma: an integrated clinical, genetic, and bioinformatic analysis.

Human molecular genetics·2026

Same journal

What do LLMs value? An evaluation framework for revealing subjective trade-offs in assessment of glycemic control.

Proceedings of machine learning research·2026

Same journal

Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift.

Proceedings of machine learning research·2026

Same journal

Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video.

Proceedings of machine learning research·2026

Same journal

Perspective: Machine Learning for Health Should Consider Social Drivers of Health.

Proceedings of machine learning research·2026

Same journal

Classifying Phonotrauma Severity from Vocal Fold Images with Soft Ordinal Regression.

Proceedings of machine learning research·2026

Same journal

Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?

Proceedings of machine learning research·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 19, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Marginal Contribution Feature Importance - an Axiomatic Approach for Explaining Data.

Amnon Catav¹, Boyang Fu², Yazeed Zoabi³

¹School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel.

Proceedings of Machine Learning Research

|September 27, 2021

Summary

This summary is machine-generated.

New feature importance methods are needed for explaining real-world data, not just models. The study introduces Marginal Contribution Feature Importance (MCI), a novel score that accurately reflects feature contributions, especially with correlated data.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

Related Experiment Videos

Last Updated: Oct 19, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

Area of Science:

Machine Learning
Data Science
Statistical Modeling

Background:

Feature importance scores are crucial for understanding model behavior and real-world phenomena.
Existing methods excel at explaining models but falter when explaining data, particularly with feature correlations.

Purpose of the Study:

To address the limitations of current feature importance scores in data explanation.
To develop a theoretically sound and empirically validated feature importance score for explaining data.
To introduce the Marginal Contribution Feature Importance (MCI) score.

Main Methods:

Defined a set of axioms for desirable properties of data-explaining feature importance scores.
Proved the uniqueness of a score satisfying these axioms.
Developed and analyzed the Marginal Contribution Feature Importance (MCI) score.
Conducted empirical evaluations to demonstrate the score's effectiveness.

Main Results:

Demonstrated the limitations of existing feature importance scores when explaining data with correlated features.
Introduced the Marginal Contribution Feature Importance (MCI) as the unique score satisfying proposed axioms.
Empirically validated the merits of MCI in explaining data.

Conclusions:

Marginal Contribution Feature Importance (MCI) offers a robust solution for explaining data, overcoming limitations of existing model-centric approaches.
MCI provides a reliable method for understanding feature contributions in real-world phenomena where direct experimentation is infeasible.