Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

Statistical Hypothesis Testing

Statistical Hypothesis Testing

Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...

Types of Hypothesis Testing

Types of Hypothesis Testing

There are three types of hypothesis tests: right-tailed, left-tailed, and two-tailed.
When the null and alternative hypotheses are stated, it is observed that the null hypothesis is a neutral statement against which the alternative hypothesis is tested. The alternative hypothesis is a claim that instead has a certain direction. If the null hypothesis claims that p = 0.5, the alternative hypothesis would be an opposing statement to this and can be put either p > 0.5, p < 0.5, or p...

Significance Testing: Overview

Significance Testing: Overview

Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

Variability: Analysis

Variability: Analysis

Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Pediatric oncology patients prefer cocoa over oral nutritional supplements in a double-blinded feeding-trial.

Frontiers in nutrition·2026

Same author

Educational inequalities are associated with distinct metabolomic and gut microbiome patterns in adults.

Social science & medicine (1982)·2026

Same author

Impact of single freeze-thaw cycles on human serum proteins: Implications for mass spectrometry biomarker validation.

iScience·2026

Same author

Effect of ice-lollies on the recovery time after anaesthesia: protocol for a cluster-randomised trial (Icesthesia).

Perioperative medicine (London, England)·2026

Same author

Identifying common risk factors for primary cellulitis in a large-scale retrospective cohort study.

International journal of infectious diseases : IJID : official publication of the International Society for Infectious Diseases·2026

Same author

Real-world analysis of immune checkpoint inhibitor efficacy and response predictors in patients treated at the CCCMunich<sup>LMU</sup> outpatient clinic.

Scientific reports·2025

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 6, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

ShadowVIMP: permutation-based multiple testing-controlled variable selection.

Tim Müller¹, Roman Hornung^2,3, Silke Szymczak⁴

¹Staburo GmbH, Aschauer Straße 26a, 81549, Munich, Bavaria, Germany. mueller@staburo.de.

BMC Bioinformatics

|May 4, 2026

Summary

This summary is machine-generated.

shadowVIMP is a novel method for multiple testing-controlled variable selection in high-dimensional data. It improves sensitivity and robustness, addressing biases in random forest variable importance scores.

Keywords:

High-dimensional Multiple testing correction Random forest Variable importance Variable selection

More Related Videos

Barnes Maze Testing Strategies with Small and Large Rodent Models

Barnes Maze Testing Strategies with Small and Large Rodent Models

Published on: February 26, 2014

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Related Experiment Videos

Last Updated: May 6, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Barnes Maze Testing Strategies with Small and Large Rodent Models

Barnes Maze Testing Strategies with Small and Large Rodent Models

Published on: February 26, 2014

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Area of Science:

Bioinformatics
Computational Biology
Statistical Genetics

Background:

Identifying biomarkers is crucial for precision medicine, especially with high-dimensional data.
Random Forests (RFs) are effective for high-dimensional data but face challenges in variable selection due to complex VIMP score distributions.
Standard statistical testing and multiple testing adjustments for RF variable importance (VIMP) are difficult.

Purpose of the Study:

To introduce shadowVIMP, a novel method for multiple testing-controlled variable selection in RF analysis.
To address limitations of existing RF variable selection methods, particularly concerning correlated and categorical variables.
To provide a robust approach for biomarker discovery in high-dimensional datasets.

Main Methods:

Propose shadowVIMP, a method inspired by permutation testing for variable selection.
shadowVIMP generates permuted variable counterparts to calculate adjusted p-values for VIMP scores.
The method preserves the correlation structure between variables, mitigating selection bias.

Main Results:

shadowVIMP demonstrates improved sensitivity and provides multiple testing-adjusted results in high-dimensional settings.
The method shows robustness against VIMP biases caused by correlated and categorical variables.
shadowVIMP can visually annotate VIMP plots for selected variable sets.

Conclusions:

shadowVIMP offers a promising approach for reliable variable selection in RF analysis.
It effectively addresses known biases in permutation-based VIMP measures.
The shadowVIMP R package is available on CRAN for practical application.