Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling and cloud-native open-source tools.

Nature methods·2024
Same author

Identifying Interpretable Latent Factors with Sparse Component Analysis.

bioRxiv : the preprint server for biology·2024
Same author

A primer on the use of machine learning to distil knowledge from data in biological psychiatry.

Molecular psychiatry·2024
Same author

Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open-source tools.

bioRxiv : the preprint server for biology·2023
Same author

Seasonal Phenology and Climate Associated Feeding Activity of Introduced <i>Marchalina hellenica</i> in Southeast Australia.

Insects·2023
Same author

Flexible neural control of motor units.

Nature neuroscience·2022
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Oct 21, 2025

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

6.0K

Learning sparse log-ratios for high-throughput sequencing data.

Elliott Gordon-Rodriguez1, Thomas P Quinn2, John P Cunningham1

  • 1Department of Statistics, Columbia University, New York, NY 10025, USA.

Bioinformatics (Oxford, England)
|September 9, 2021
PubMed
Summary
This summary is machine-generated.

We introduce CoDaCoRe, a deep learning algorithm for discovering sparse log-ratio biomarkers in high-throughput sequencing data. CoDaCoRe significantly accelerates biomarker discovery and achieves state-of-the-art performance.

More Related Videos

2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications
05:41

2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications

Published on: July 10, 2020

2.1K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.2K

Related Experiment Videos

Last Updated: Oct 21, 2025

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

6.0K
2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications
05:41

2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications

Published on: July 10, 2020

2.1K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.2K

Area of Science:

  • Bioinformatics
  • Machine Learning
  • Computational Biology

Background:

  • Discovering sparse biomarkers associated with outcomes is crucial in bioinformatics.
  • Log-ratio biomarkers are important for compositional data (CoDa), including high-throughput sequencing (HTS) data.
  • Identifying these biomarkers is computationally challenging due to combinatorial optimization problems, limiting current methods' scalability.

Purpose of the Study:

  • To present CoDaCoRe, a novel deep learning algorithm for identifying sparse, interpretable, and predictive log-ratio biomarkers.
  • To address the computational challenges and scalability limitations of existing methods for biomarker discovery in HTS and CoDa.

Main Methods:

  • CoDaCoRe employs a continuous relaxation technique to approximate the combinatorial optimization problem.
  • The relaxation is efficiently optimized using gradient descent and the modern machine learning (ML) toolbox.
  • The algorithm is designed to handle high-dimensional datasets.

Main Results:

  • CoDaCoRe runs orders of magnitude faster than existing methods.
  • The algorithm achieves state-of-the-art performance in predictive accuracy and biomarker sparsity.
  • Outperformance is validated across diverse datasets, including challenging high-dimensional microbiome, metabolite, and microRNA data.

Conclusions:

  • CoDaCoRe offers a computationally efficient and highly accurate solution for log-ratio biomarker discovery.
  • The method overcomes scalability limitations, enabling analysis of previously intractable high-dimensional datasets.
  • CoDaCoRe represents a significant advancement in the automated discovery of sparse biomarkers from complex biological data.