Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.3K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Disease- and chemotherapy-associated salivary microbiome changes in breast cancer patients.

Breast cancer research and treatment·2026
Same author

Νovel methylation biomarkers in liquid biopsy and classifying biosignatures for the clinical management of breast cancer.

Breast cancer research : BCR·2026
Same author

AutoML identification of microRNA biomarkers in high-risk pediatric acute lymphoblastic leukemia.

Non-coding RNA research·2025
Same author

MicroRNA Expression Analysis and Biological Pathways in Chemoresistant Non-Small Cell Lung Cancer.

Cancers·2025
Same author

Classification of non-TCGA cancer samples to TCGA molecular subtypes using compact feature sets.

Cancer cell·2025
Same author

BRCA1 & BRCA2 methylation as a prognostic and predictive biomarker in cancer: Implementation in liquid biopsy in the era of precision medicine.

Clinical epigenetics·2024
Same journal

circ2DGNN: circRNA-Disease Association Prediction via Transformer-Based Graph Neural Network.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Hierarchical Hypergraph Learning in Association- Weighted Heterogeneous Network for miRNA- Disease Association Identification.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Discriminative Domain Adaption Network for Simultaneously Removing Batch Effects and Annotating Cell Types in Single-Cell RNA-Seq.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

MLW-BFECF: A Multi-Weighted Dynamic Cascade Forest Based on Bilinear Feature Extraction for Predicting the Stage of Kidney Renal Clear Cell Carcinoma on Multi-Modal Gene Data.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

An End-to-End Knowledge Graph Fused Graph Neural Network for Accurate Protein-Protein Interactions Prediction.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Generative Biomedical Event Extraction With Constrained Decoding Strategy.

IEEE/ACM transactions on computational biology and bioinformatics·2024
See all related articles

Related Experiment Video

Updated: Dec 6, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K

The γ-OMP Algorithm for Feature Selection With Application to Gene Expression Data.

Michail Tsagris, Zacharias Papadovasilakis, Kleanthi Lakiotaki

    IEEE/ACM Transactions on Computational Biology and Bioinformatics
    |October 9, 2020
    PubMed
    Summary
    This summary is machine-generated.

    We introduce gamma-OMP, a scalable feature selection algorithm for predictive analytics. It handles diverse data types and outperforms LASSO on gene expression datasets for classification and regression tasks.

    More Related Videos

    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
    03:37

    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

    Published on: March 1, 2024

    1.1K
    Optimization for Sequencing and Analysis of Degraded FFPE-RNA Samples
    07:30

    Optimization for Sequencing and Analysis of Degraded FFPE-RNA Samples

    Published on: June 8, 2020

    12.5K

    Related Experiment Videos

    Last Updated: Dec 6, 2025

    Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
    07:35

    Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

    Published on: October 11, 2018

    7.9K
    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
    03:37

    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

    Published on: March 1, 2024

    1.1K
    Optimization for Sequencing and Analysis of Degraded FFPE-RNA Samples
    07:30

    Optimization for Sequencing and Analysis of Degraded FFPE-RNA Samples

    Published on: June 8, 2020

    12.5K

    Area of Science:

    • Bioinformatics
    • Computational Biology
    • Statistical Learning

    Background:

    • Feature selection is crucial for predictive analytics, especially with high-dimensional molecular data.
    • Existing algorithms like LASSO face scalability challenges with tens of thousands of features.

    Purpose of the Study:

    • To develop a generalized and scalable feature selection algorithm, gamma-OMP.
    • To enhance predictive analytics for diverse molecular data types.

    Main Methods:

    • Proposed gamma-OMP, a generalization of Orthogonal Matching Pursuit.
    • Enabled handling of various outcomes (continuous, binary, time-to-event), categorical features, and predictive models.
    • Compared gamma-OMP against LASSO using simulated and real gene expression datasets.

    Main Results:

    • Gamma-OMP demonstrated comparable or superior performance to LASSO.
    • Effectiveness shown across binary classification, regression, and time-to-event analyses.
    • Successful application in bioinformatics analysis settings.

    Conclusions:

    • Gamma-OMP offers a scalable and versatile solution for feature selection in high-dimensional data.
    • The algorithm's flexibility and effectiveness make it suitable for complex bioinformatics tasks.
    • Gamma-OMP is easy to implement and extend for future research.