Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Sparse component analysis: A method that uncovers separable computations within neural population activity.

Neuron·2026
Same author

Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling and cloud-native open-source tools.

Nature methods·2024
Same author

Identifying Interpretable Latent Factors with Sparse Component Analysis.

bioRxiv : the preprint server for biology·2024
Same author

A primer on the use of machine learning to distil knowledge from data in biological psychiatry.

Molecular psychiatry·2024
Same author

Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open-source tools.

bioRxiv : the preprint server for biology·2023
Same author

Seasonal Phenology and Climate Associated Feeding Activity of Introduced <i>Marchalina hellenica</i> in Southeast Australia.

Insects·2023
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026
Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Oct 21, 2025

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

6.0K

Learning sparse log-ratios for high-throughput sequencing data.

Elliott Gordon-Rodriguez1, Thomas P Quinn2, John P Cunningham1

  • 1Department of Statistics, Columbia University, New York, NY 10025, USA.

Bioinformatics (Oxford, England)
|September 9, 2021
PubMed
Summary
This summary is machine-generated.

We introduce CoDaCoRe, a deep learning algorithm for discovering sparse log-ratio biomarkers in high-throughput sequencing data. CoDaCoRe significantly accelerates biomarker discovery and achieves state-of-the-art performance.

More Related Videos

2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications
05:41

2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications

Published on: July 10, 2020

2.1K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.2K

Related Experiment Videos

Last Updated: Oct 21, 2025

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

6.0K
2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications
05:41

2D-HELS MS Seq: A General LC-MS-Based Method for Direct and de novo Sequencing of RNA Mixtures with Different Nucleotide Modifications

Published on: July 10, 2020

2.1K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.2K

Area of Science:

  • Bioinformatics
  • Machine Learning
  • Computational Biology

Background:

  • Discovering sparse biomarkers associated with outcomes is crucial in bioinformatics.
  • Log-ratio biomarkers are important for compositional data (CoDa), including high-throughput sequencing (HTS) data.
  • Identifying these biomarkers is computationally challenging due to combinatorial optimization problems, limiting current methods' scalability.

Purpose of the Study:

  • To present CoDaCoRe, a novel deep learning algorithm for identifying sparse, interpretable, and predictive log-ratio biomarkers.
  • To address the computational challenges and scalability limitations of existing methods for biomarker discovery in HTS and CoDa.

Main Methods:

  • CoDaCoRe employs a continuous relaxation technique to approximate the combinatorial optimization problem.
  • The relaxation is efficiently optimized using gradient descent and the modern machine learning (ML) toolbox.
  • The algorithm is designed to handle high-dimensional datasets.

Main Results:

  • CoDaCoRe runs orders of magnitude faster than existing methods.
  • The algorithm achieves state-of-the-art performance in predictive accuracy and biomarker sparsity.
  • Outperformance is validated across diverse datasets, including challenging high-dimensional microbiome, metabolite, and microRNA data.

Conclusions:

  • CoDaCoRe offers a computationally efficient and highly accurate solution for log-ratio biomarker discovery.
  • The method overcomes scalability limitations, enabling analysis of previously intractable high-dimensional datasets.
  • CoDaCoRe represents a significant advancement in the automated discovery of sparse biomarkers from complex biological data.