Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

313
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
313
Significance Testing: Overview01:04

Significance Testing: Overview

7.7K
Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...
7.7K
Applications of Molecular Taxonomy01:20

Applications of Molecular Taxonomy

248
Molecular taxonomy has revolutionized the understanding and classification of bacteria, providing precise insights into their diversity, evolutionary relationships, and ecological roles. By utilizing molecular techniques such as DNA sequencing and fingerprinting, researchers have made significant strides in various fields related to bacterial studies.Resolving Taxonomic AmbiguitiesMolecular taxonomy has been instrumental in distinguishing closely related bacterial species initially thought to...
248
DNA Microarrays02:34

DNA Microarrays

19.2K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
19.2K
Fisher's Exact Test01:08

Fisher's Exact Test

899
Fisher's exact test is a statistical significance test widely used to analyze 2x2 contingency tables, particularly in situations where sample sizes are small. Unlike the chi-squared test, which approximates P-values and assumes minimum expected frequencies of at least five in each cell, Fisher's exact test calculates the exact probability (P-value) of observing the data or more extreme results under the null hypothesis. This feature makes it especially valuable when the assumptions of...
899
Sign Test for Matched Pairs01:17

Sign Test for Matched Pairs

248
The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in...
248

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Optimization of stapled peptide inhibitors reveals design principles for targeting talin-induced integrin activation.

bioRxiv : the preprint server for biology·2026
Same author

Direct comparison of the structural dynamics between spontaneous and ligand-induced folding of staphylococcal nuclease.

Protein science : a publication of the Protein Society·2025
Same author

Molecular and translational biology of the blood-based VeriStrat® proteomic test used in cancer immunotherapy treatment guidance.

Journal of mass spectrometry and advances in the clinical lab·2023
Same author

The needs of multiple birth families during the first 1001 critical days: A rapid review with a systematic literature search and narrative synthesis.

Public health nursing (Boston, Mass.)·2023
Same author

Folding of Staphylococcal Nuclease Induced by Binding of Chemically Modified Substrate Analogues Sheds Light on Mechanisms of Coupled Folding/Binding Reactions.

Biochemistry·2023
Same author

PDLIM3 supports hedgehog signaling in medulloblastoma by facilitating cilia formation.

Cell death and differentiation·2023
Same journal

Risk prediction of sepsis-associated acute kidney injury: development, validation of a machine learning model with multicenter data.

BMC medical informatics and decision making·2026
Same journal

Trajectory analysis of sleep disorders and anxiety-depression in female breast cancer patients undergoing chemotherapy: based on group-based Multi-Trajectory Model and machine learning.

BMC medical informatics and decision making·2026
Same journal

Multitask learning of longitudinal circulating biomarkers and clinical outcomes: identification of optimal machine-learning and deep-learning models.

BMC medical informatics and decision making·2026
Same journal

Comparative machine learning approaches to prognosticate clinical outcomes in oral and maxillofacial space infections: a retrospective analysis.

BMC medical informatics and decision making·2026
Same journal

Development and validation of machine learning models for early diagnosis of hemophagocytic lymphohistiocytosis in pediatric Epstein-Barr virus infection.

BMC medical informatics and decision making·2026
Same journal

Clinical subphenotypes in septic patients with new-onset atrial fibrillation: validation and parsimonious classifier model development.

BMC medical informatics and decision making·2026
See all related articles

Related Experiment Video

Updated: Oct 29, 2025

Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

17.1K

Explaining multivariate molecular diagnostic tests via Shapley values.

Joanna Roder1, Laura Maguire2, Robert Georgantas2

  • 1Biodesix, Inc., 2970 Wilderness Place, Ste100, Boulder, CO, 80301, USA. joanna.roder@biodesix.com.

BMC Medical Informatics and Decision Making
|July 9, 2021
PubMed
Summary
This summary is machine-generated.

Shapley values offer interpretable explanations for molecular diagnostic tests, revealing attribute importance for individual patients. However, approximate methods may yield different results compared to exact Shapley values, necessitating caution in their application.

Keywords:
Artificial intelligenceExplainabilityInterpretabilityMachine learningMolecular diagnostic testShapley values

More Related Videos

VDJ-Seq: Deep Sequencing Analysis of Rearranged Immunoglobulin Heavy Chain Gene to Reveal Clonal Evolution Patterns of B Cell Lymphoma
15:07

VDJ-Seq: Deep Sequencing Analysis of Rearranged Immunoglobulin Heavy Chain Gene to Reveal Clonal Evolution Patterns of B Cell Lymphoma

Published on: December 28, 2015

27.0K
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.9K

Related Experiment Videos

Last Updated: Oct 29, 2025

Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

17.1K
VDJ-Seq: Deep Sequencing Analysis of Rearranged Immunoglobulin Heavy Chain Gene to Reveal Clonal Evolution Patterns of B Cell Lymphoma
15:07

VDJ-Seq: Deep Sequencing Analysis of Rearranged Immunoglobulin Heavy Chain Gene to Reveal Clonal Evolution Patterns of B Cell Lymphoma

Published on: December 28, 2015

27.0K
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.9K

Area of Science:

  • Biostatistics
  • Bioinformatics
  • Machine Learning

Background:

  • Machine learning (ML) is crucial for extracting insights from molecular data to develop diagnostic tests.
  • Algorithmic explainability is essential for understanding how ML models generate classifications.
  • Shapley values provide a method for explaining ML model outputs based on input data.

Purpose of the Study:

  • To calculate and interpret exact Shapley values for a clinical molecular diagnostic test (VeriStrat®).
  • To compare exact Shapley values with approximation methods like LIME and SHAP.
  • To assess the utility of Shapley values for sample-level explanations and patient subgroup identification.

Main Methods:

  • Exact Shapley values were computed for the VeriStrat® test using data from 256 patients.
  • Standard approximation techniques, including LIME and SHAP-based methods, were employed.
  • Results from exact Shapley values were compared against those from approximation methods.

Main Results:

  • Exact Shapley values demonstrated that attribute importance for classification varied by sample.
  • Interpretability at the sample/patient level was achieved using exact Shapley values and interaction metrics.
  • Approximation methods (LIME, SHAP) produced quantitatively and qualitatively different results compared to exact Shapley values.

Conclusions:

  • Shapley values effectively determine attribute importance in molecular diagnostic tests for individual patients.
  • Shapley value profiles can define patient subgroups, potentially guiding translational research.
  • Caution is advised when using approximate Shapley value methods due to potential discrepancies with exact values, especially with correlated molecular data and small training sets.