Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

323
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
323
DNA Microarrays02:34

DNA Microarrays

20.3K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
20.3K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Low-temperature sintering of silver nanoparticles on paper by surface modification.

Nanotechnology·2019
Same author

Two-Staged Versus Three-Staged Laparoscopic Anorectoplasty for Patients with Rectoprostatic and Bladder Neck Fistulas: A Comparative Study.

Journal of laparoendoscopic & advanced surgical techniques. Part A·2019
Same author

Genome Sequence of a Novel HIV-1 Circulating Recombinant Form (CRF103_01B) Identified from Hebei Province, China.

AIDS research and human retroviruses·2019
Same author

Identification of a Novel HIV-1 Second-Generation (CRF01_AE/B) Among Men Who Have Sex with Men in Tianjin, China.

AIDS research and human retroviruses·2019
Same author

Characterization of a Novel HIV-1 Recombinant Form (CRF01_AE/CRF07_BC/CRF08_BC) Identified from Guangxi, China.

AIDS research and human retroviruses·2019
Same author

Event-Triggered Multiagent Optimization for Two-Layered Model of Hybrid Energy System With Price Bidding-Based Demand Response.

IEEE transactions on cybernetics·2019
Same journal

Role of incretins in Alzheimer's disease and their impact on behavioral aspects.

Chinese medical journal·2026
Same journal

Asparagine-linked glycosyltransferase 6 deficiency suppresses tumorigenesis in clear cell renal cell carcinoma by regulating apoptosis and immunotherapy sensitivity.

Chinese medical journal·2026
Same journal

Impact of joint effect of parental anthropometrics on fetal growth trajectories: A national multicenter cohort study.

Chinese medical journal·2026
Same journal

Novel treatments for Parkinson's disease.

Chinese medical journal·2026
Same journal

Incidence, DALYs, and death rates of chronic kidney disease in China, 1990-2023: Results from the Global Burden of Disease Study 2023.

Chinese medical journal·2026
Same journal

Reclassifying renal cell carcinoma with pathologic lymph node metastasis: An approach to refining prognostic stratification in patients with AJCC III and IV stages.

Chinese medical journal·2026
See all related articles

Related Experiment Video

Updated: Dec 20, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K

Identification of differential gene expression for microarray data using recursive random forest.

Xiao-yan Wu1, Zhen-yu Wu, Kang Li

  • 1Department of Biostatistics, College of Public Health, Harbin Medical University, Harbin, Heilongjiang, China.

Chinese Medical Journal
|February 4, 2009
PubMed
Summary
This summary is machine-generated.

Recursive random forest effectively selects key genes from high-dimensional DNA microarray data. This method improves classification accuracy and identifies disease-associated genes for better clinical diagnosis and research.

More Related Videos

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

3.9K
Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2
10:10

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2

Published on: September 18, 2021

40.5K

Related Experiment Videos

Last Updated: Dec 20, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K
Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

3.9K
Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2
10:10

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2

Published on: September 18, 2021

40.5K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • DNA microarray data presents challenges due to high dimensionality and complex structures.
  • Traditional random forest methods struggle with optimal gene selection due to undifferentiated genes.
  • Effective gene selection is crucial for accurate analysis of high-dimensional biological data.

Purpose of the Study:

  • To introduce and validate recursive random forest analysis for gene selection in DNA microarray data.
  • To demonstrate the effectiveness of recursive random forest in improving classification accuracy.
  • To identify biologically relevant genes associated with diseases from microarray datasets.

Main Methods:

  • Recursive random forest, an enhanced random forest algorithm, iteratively removes non-influential genes.
  • Gene importance is assessed, and classification performance is evaluated using the Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve.
  • The method was applied to five diverse microarray datasets: colon, prostate, leukemia, breast, and skin.

Main Results:

  • Recursive random forest achieved superior classification results with a reduced set of selected genes across all analyzed datasets.
  • Biological validation using NCBI confirmed the relevance of selected genes for breast and skin cancer data.
  • The method effectively retained genes crucial for disease association.

Conclusions:

  • Recursive random forest is a powerful tool for DNA microarray data analysis and gene selection.
  • The identified genes provide valuable insights for clinical diagnostics and understanding disease mechanisms.
  • This approach enhances the interpretability and utility of high-dimensional genomic data.