Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Random Variables

Random Variables

A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

Random Sampling Method

Random Sampling Method

Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Benchmarking community drug response prediction models: datasets, models, tools, and metrics for cross-dataset generalization analysis.

Briefings in bioinformatics·2026

Same author

AdapTor: Adaptive Topological Regression for quantitative structure-activity relationship modeling.

Journal of cheminformatics·2025

Same author

Predictive Modeling of Anticancer Drug Sensitivity Using REFINED CNN.

Methods in molecular biology (Clifton, N.J.)·2025

Same author

Low-dose dietary vorinostat increases brain histone acetylation levels and reduces oxidative stress in an Alzheimer's disease mouse model.

Journal of Alzheimer's disease : JAD·2025

Same author

Hyperspectral imaging to characterize the vegetative tissue biochemical changes in response to water deficit conditions in sorghum (<i>Sorghum bicolor</i>).

Frontiers in plant science·2025

Same author

Cross study transcriptomic investigation of Alzheimer's brain tissue discoveries and limitations.

Scientific reports·2025

Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026

Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026

Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026

Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026

Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026

Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 16, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Sequential feature selection and inference using multi-variate random forests.

Joshua Mayer¹, Raziur Rahman², Souparno Ghosh¹

¹Department of Mathematics and Statistics, Texas Tech University, Lubbock, TX 79409, USA.

Bioinformatics (Oxford, England)

|December 22, 2017

Summary

This summary is machine-generated.

This study introduces a novel Sequential Multi-Response Feature Selection (SMuRF) method for identifying statistically significant features in Random Forests (RF). SMuRF uses conditional inference for coherent variable selection and prediction, enhancing RF

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Basics of Multivariate Analysis in Neuroimaging Data

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

Related Experiment Videos

Last Updated: Feb 16, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Basics of Multivariate Analysis in Neuroimaging Data

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

Area of Science:

Computational Biology
Bioinformatics
Machine Learning

Background:

Random Forest (RF) is a popular prediction tool, but lacks robust methods for identifying statistically significant features, especially in multivariate settings.
Existing feature importance rankings offer relative measures but lack a general inferential mechanism for statistical significance.

Purpose of the Study:

To develop an inferentially justifiable and model-free variable selection procedure for Random Forests.
To create a coherent framework for both variable selection and prediction using conditional inference.
To identify statistically significant genetic features impacting drug sensitivities.

Main Methods:

Utilized the conditional inference tree framework to build a Random Forest by sequentially deleting features based on hypothesis testing.
Developed a sequential algorithm for inferentially sound, model-free variable selection.
Applied the Sequential Multi-Response Feature Selection (SMuRF) approach to the Genomics of Drug Sensitivity for Cancer dataset.

Main Results:

The proposed method provides an inferentially justifiable variable selection procedure.
The Sequential Multi-Response Feature Selection (SMuRF) approach successfully identified significant genetic predictors of drug sensitivity.
Biological validation confirmed the significance of the identified genetic characteristics.

Conclusions:

The developed methodology offers a coherent approach to both variable selection and prediction within the conditional inference framework.
SMuRF enhances Random Forest analysis by enabling statistically rigorous feature identification.
The application on the Genomics of Drug Sensitivity for Cancer dataset highlights the method's utility in biological discovery.