Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Randomized Experiments01:13

Randomized Experiments

6.3K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
6.3K
Statistical Hypothesis Testing01:16

Statistical Hypothesis Testing

6.1K
Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...
6.1K
Types of Hypothesis Testing01:11

Types of Hypothesis Testing

28.1K
There are three types of hypothesis tests: right-tailed, left-tailed, and two-tailed.
When the null and alternative hypotheses are stated, it is observed that the null hypothesis is a neutral statement against which the alternative hypothesis is tested. The alternative hypothesis is a claim that instead has a certain direction. If the null hypothesis claims that p = 0.5, the alternative hypothesis would be an opposing statement to this and can be put either p > 0.5, p < 0.5, or p...
28.1K
Significance Testing: Overview01:04

Significance Testing: Overview

10.2K
Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...
10.2K
Multiple Comparison Tests01:13

Multiple Comparison Tests

3.5K
Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
3.5K
Variability: Analysis01:11

Variability: Analysis

1.1K
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Pediatric oncology patients prefer cocoa over oral nutritional supplements in a double-blinded feeding-trial.

Frontiers in nutrition·2026
Same author

Educational inequalities are associated with distinct metabolomic and gut microbiome patterns in adults.

Social science & medicine (1982)·2026
Same author

Impact of single freeze-thaw cycles on human serum proteins: Implications for mass spectrometry biomarker validation.

iScience·2026
Same author

Effect of ice-lollies on the recovery time after anaesthesia: protocol for a cluster-randomised trial (Icesthesia).

Perioperative medicine (London, England)·2026
Same author

Identifying common risk factors for primary cellulitis in a large-scale retrospective cohort study.

International journal of infectious diseases : IJID : official publication of the International Society for Infectious Diseases·2026
Same author

Real-world analysis of immune checkpoint inhibitor efficacy and response predictors in patients treated at the CCCMunich<sup>LMU</sup> outpatient clinic.

Scientific reports·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: May 6, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K

ShadowVIMP: permutation-based multiple testing-controlled variable selection.

Tim Müller1, Roman Hornung2,3, Silke Szymczak4

  • 1Staburo GmbH, Aschauer Straße 26a, 81549, Munich, Bavaria, Germany. mueller@staburo.de.

BMC Bioinformatics
|May 4, 2026
PubMed
Summary
This summary is machine-generated.

shadowVIMP is a novel method for multiple testing-controlled variable selection in high-dimensional data. It improves sensitivity and robustness, addressing biases in random forest variable importance scores.

Keywords:
High-dimensionalMultiple testing correctionRandom forestVariable importanceVariable selection

More Related Videos

Barnes Maze Testing Strategies with Small and Large Rodent Models
12:59

Barnes Maze Testing Strategies with Small and Large Rodent Models

Published on: February 26, 2014

42.8K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

16.6K

Related Experiment Videos

Last Updated: May 6, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K
Barnes Maze Testing Strategies with Small and Large Rodent Models
12:59

Barnes Maze Testing Strategies with Small and Large Rodent Models

Published on: February 26, 2014

42.8K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

16.6K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Statistical Genetics

Background:

  • Identifying biomarkers is crucial for precision medicine, especially with high-dimensional data.
  • Random Forests (RFs) are effective for high-dimensional data but face challenges in variable selection due to complex VIMP score distributions.
  • Standard statistical testing and multiple testing adjustments for RF variable importance (VIMP) are difficult.

Purpose of the Study:

  • To introduce shadowVIMP, a novel method for multiple testing-controlled variable selection in RF analysis.
  • To address limitations of existing RF variable selection methods, particularly concerning correlated and categorical variables.
  • To provide a robust approach for biomarker discovery in high-dimensional datasets.

Main Methods:

  • Propose shadowVIMP, a method inspired by permutation testing for variable selection.
  • shadowVIMP generates permuted variable counterparts to calculate adjusted p-values for VIMP scores.
  • The method preserves the correlation structure between variables, mitigating selection bias.

Main Results:

  • shadowVIMP demonstrates improved sensitivity and provides multiple testing-adjusted results in high-dimensional settings.
  • The method shows robustness against VIMP biases caused by correlated and categorical variables.
  • shadowVIMP can visually annotate VIMP plots for selected variable sets.

Conclusions:

  • shadowVIMP offers a promising approach for reliable variable selection in RF analysis.
  • It effectively addresses known biases in permutation-based VIMP measures.
  • The shadowVIMP R package is available on CRAN for practical application.