Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Outliers and Influential Points01:08

Outliers and Influential Points

4.1K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
4.1K
Vector Algebra: Graphical Method01:10

Vector Algebra: Graphical Method

12.2K
Vectors can be multiplied by scalars, added to other vectors, or subtracted from other vectors. The vector sum of two (or more) vectors is called the resultant vector or, for short, the resultant.
We use the laws of geometry to construct resultant vectors, followed by trigonometry to find vector magnitudes and directions. For a geometric construction of the sum of two vectors in a plane, we follow the parallelogram rule. Suppose two vectors are at arbitrary positions. Translate either one of...
12.2K
Scatter Plot01:15

Scatter Plot

7.0K
The most common and easiest way to display the relationship between two variables, x and y, is a scatter plot. A scatter plot shows the direction of a relationship between the variables. A clear direction happens when there is either:
7.0K
Plotting of Topographic Maps01:29

Plotting of Topographic Maps

62
Topographic maps represent the Earth's surface features using contour lines, which connect points of equal elevation to create a two-dimensional representation of three-dimensional terrain. Creating a topographic map requires a systematic approach.Begin by plotting a scaled grid and marking intersections corresponding to the survey's elevation data points. Assign elevation values at these intersections to build the base map. Next, determine contour levels using a consistent contour interval,...
62
Survival Tree01:19

Survival Tree

111
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
111
Time-Series Graph00:54

Time-Series Graph

4.4K
A time-series graph is a line graph with repeated measurements taken at successive intervals of time. It is also called a time series chart. To construct a time-series graph, one must look at both pieces of a paired data set. The horizontal axis is used to plot the time increments, and the vertical axis is used to plot the values of the variable that one is measuring. By using the axes in this way, each point on the graph will correspond to time and a measured quantity. The points on the graph...
4.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

HarveST uses a heterogeneous graph learning framework to reveal spatial transcriptomics patterns.

Communications biology·2026
Same author

Caffeine Sodium Benzoate-Induced Macrophage M1 Polarization Promotes Endothelial Progenitor Cell Dysfunction by Inducing Mitochondrial Dysfunction Via Targeting let-7a-5p/OPA1 Axis.

Applied biochemistry and biotechnology·2026
Same author

FAST: Scalable Factor Analysis for Spatial Dimension Reduction of Multi-section Spatial Transcriptomics.

Genomics, proteomics & bioinformatics·2026
Same author

Bioactive binary Schiff-base hydrogel from chitosan and functional PEGylated dialdehydes: Synthesis and structure-properties correlation.

Carbohydrate polymers·2025
Same author

Network-Guided Sparse Subspace Clustering on Single-Cell Data.

Journal of computational biology : a journal of computational molecular cell biology·2025
Same author

Nonlinear embedding and integration of omics data: a fast and tuning-free approach.

Briefings in bioinformatics·2025
Same journal

Correction: Reis et al. Bioinks Enriched with ECM Components Obtained by Supercritical Extraction. <i>Biomolecules</i> 2022, <i>12</i>, 394.

Biomolecules·2026
Same journal

Correction: Kim, K.-H.; Yoo, B.C. Gintonin as a Lysophosphatidic Acid-Enriched GPCR Ligand System: Molecular Architecture and Receptor Pharmacology in <i>Panax ginseng</i>. <i>Biomolecules</i> 2026, <i>16</i>, 465.

Biomolecules·2026
Same journal

Correction: Bastyte et al. The Association of Vitamin D Receptor Gene Polymorphisms with Vitamin D, Total IgE, and Blood Eosinophils in Patients with Atopy. <i>Biomolecules</i> 2024, <i>14</i>, 212.

Biomolecules·2026
Same journal

AtHSPR Plays a Positive Role in Arabidopsis Resistance Against <i>Pseudomonas syringae</i> pv. <i>tomato</i> DC3000 by Interacting with TOP1.

Biomolecules·2026
Same journal

CYTH4 Facilitates Renal Cell Carcinoma via Enhancing Proliferation and Likely Immune Evasion.

Biomolecules·2026
Same journal

Integrated Immune-Gut Profiling Identifies an Exploratory Pediatric Inflammatory Intestinal Profile Associated with Food-Specific IgG Reactivity.

Biomolecules·2026
See all related articles

Related Experiment Video

Updated: Jul 21, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K

Graph Random Forest: A Graph Embedded Algorithm for Identifying Highly Connected Important Features.

Leqi Tian1,2, Wenbin Wu1, Tianwei Yu1,2,3

  • 1School of Data Science, The Chinese University of Hong Kong, Shenzhen 518172, China.

Biomolecules
|July 29, 2023
PubMed
Summary
This summary is machine-generated.

Graph Random Forest (GRF) improves gene selection in biological data by integrating network information. This method identifies functionally connected important genes, enhancing interpretability while maintaining classification accuracy.

Keywords:
feature selectiongene networkrandom forest

More Related Videos

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation
08:47

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Published on: February 9, 2024

1.5K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.5K

Related Experiment Videos

Last Updated: Jul 21, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation
08:47

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Published on: February 9, 2024

1.5K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.5K

Area of Science:

  • Bioinformatics
  • Machine Learning
  • Computational Biology

Background:

  • Random Forest (RF) is a robust machine learning algorithm effective for classification and regression, particularly in biological applications with limited sample sizes.
  • Gene expression datasets often present a high-dimensional challenge, with significantly more features (genes) than samples (p >> n).
  • Standard RF for feature selection can yield scattered important genes, contradicting the biological principle of functional gene network consistency.

Purpose of the Study:

  • To develop an enhanced feature selection method that incorporates biological network topology into the Random Forest algorithm.
  • To identify functionally connected important genes by leveraging known biological networks.
  • To improve the interpretability of feature selection in high-dimensional biological data.

Main Methods:

  • Introduction of Graph Random Forest (GRF), a novel algorithm that integrates external biological network information during forest construction.
  • GRF identifies important features that form highly connected sub-graphs within the biological network.
  • Validation through simulation experiments and application to real-world RNA-seq datasets (non-small cell lung cancer and human embryonic stem cells).

Main Results:

  • GRF achieves classification accuracy comparable to standard RF.
  • The selected important genes identified by GRF exhibit high connectivity within biological networks, forming interpretable sub-graphs.
  • The method demonstrates effectiveness in identifying biologically relevant features.

Conclusions:

  • Graph Random Forest (GRF) offers an effective approach for feature selection in biological data by incorporating network topology.
  • GRF enhances the biological interpretability of selected features through identification of connected gene sub-graphs.
  • The proposed method is a valuable addition to graph-based classification and feature selection techniques in bioinformatics.