Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Regression01:25

Multiple Regression

4.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.0K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

431
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
431
Survival Tree01:19

Survival Tree

439
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
439
Random Variables01:09

Random Variables

17.9K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
17.9K
Classification of Signals01:30

Classification of Signals

1.5K
In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...
1.5K
Random Sampling Method01:09

Random Sampling Method

15.1K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
15.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Benchmarking community drug response prediction models: datasets, models, tools, and metrics for cross-dataset generalization analysis.

Briefings in bioinformatics·2026
Same author

AdapTor: Adaptive Topological Regression for quantitative structure-activity relationship modeling.

Journal of cheminformatics·2025
Same author

Predictive Modeling of Anticancer Drug Sensitivity Using REFINED CNN.

Methods in molecular biology (Clifton, N.J.)·2025
Same author

Low-dose dietary vorinostat increases brain histone acetylation levels and reduces oxidative stress in an Alzheimer's disease mouse model.

Journal of Alzheimer's disease : JAD·2025
Same author

Hyperspectral imaging to characterize the vegetative tissue biochemical changes in response to water deficit conditions in sorghum (<i>Sorghum bicolor</i>).

Frontiers in plant science·2025
Same author

Cross study transcriptomic investigation of Alzheimer's brain tissue discoveries and limitations.

Scientific reports·2025
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026
Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Feb 16, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

Sequential feature selection and inference using multi-variate random forests.

Joshua Mayer1, Raziur Rahman2, Souparno Ghosh1

  • 1Department of Mathematics and Statistics, Texas Tech University, Lubbock, TX 79409, USA.

Bioinformatics (Oxford, England)
|December 22, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a novel Sequential Multi-Response Feature Selection (SMuRF) method for identifying statistically significant features in Random Forests (RF). SMuRF uses conditional inference for coherent variable selection and prediction, enhancing RF

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

17.4K

Related Experiment Videos

Last Updated: Feb 16, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

17.4K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Machine Learning

Background:

  • Random Forest (RF) is a popular prediction tool, but lacks robust methods for identifying statistically significant features, especially in multivariate settings.
  • Existing feature importance rankings offer relative measures but lack a general inferential mechanism for statistical significance.

Purpose of the Study:

  • To develop an inferentially justifiable and model-free variable selection procedure for Random Forests.
  • To create a coherent framework for both variable selection and prediction using conditional inference.
  • To identify statistically significant genetic features impacting drug sensitivities.

Main Methods:

  • Utilized the conditional inference tree framework to build a Random Forest by sequentially deleting features based on hypothesis testing.
  • Developed a sequential algorithm for inferentially sound, model-free variable selection.
  • Applied the Sequential Multi-Response Feature Selection (SMuRF) approach to the Genomics of Drug Sensitivity for Cancer dataset.

Main Results:

  • The proposed method provides an inferentially justifiable variable selection procedure.
  • The Sequential Multi-Response Feature Selection (SMuRF) approach successfully identified significant genetic predictors of drug sensitivity.
  • Biological validation confirmed the significance of the identified genetic characteristics.

Conclusions:

  • The developed methodology offers a coherent approach to both variable selection and prediction within the conditional inference framework.
  • SMuRF enhances Random Forest analysis by enabling statistically rigorous feature identification.
  • The application on the Genomics of Drug Sensitivity for Cancer dataset highlights the method's utility in biological discovery.