Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.1K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.1K
Genomics02:02

Genomics

36.9K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
36.9K
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

99
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
99
Next-generation Sequencing03:00

Next-generation Sequencing

92.3K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
92.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Safety, Pharmacokinetics, and Pharmacodynamics of SHR6508, a Calcium-Sensing Receptor Agonist, in Maintenance Hemodialysis Patients With Secondary Hyperparathyroidism: A Multicenter, Randomized, Double-Blind, Placebo-Controlled Phase 1 Study.

Kidney medicine·2026
Same author

Seasonal phenology and spatiotemporal persistence of fecal coliform contamination in shellfish-harvesting waters of Canada's Pacific and Atlantic coasts.

Environmental pollution (Barking, Essex : 1987)·2026
Same author

PANA-Surv: A Pathway-Guided Adaptive Neighborhood Augmentation Framework Using KEGG Pathways for Multi-Omics Cancer Prognosis.

Genes·2026
Same author

Mitochondrial-derived peptide MOTS-c activates metabolic signaling but blunts reparative function in human mesenchymal stromal cells.

Inflammation and regeneration·2026
Same author

Large scale plasma proteomics: opening up new frontiers of biomarker and therapeutic target discoveries in COPD.

The European respiratory journal·2026
Same author

Laparoscopic proximal gastrectomy with esophagogastrostomy using overlap method combined with gastric remnant "U"-shaped fold: a retrospective cohort study.

Frontiers in surgery·2026
Same journal

Transcriptomic analysis reveals FcγR-mediated phagocytosis as a key pathway for the anti-inflammatory action of <i>Polygonatum sibiricum</i> polysaccharides in loach.

Frontiers in genetics·2026
Same journal

A novel <i>ABO</i> splice site variant underlying the A<sub>3</sub> phenotype: immunogenetic basis and functional dissection.

Frontiers in genetics·2026
Same journal

Case Report: Identification of two novel <i>ALMS1</i> variants in a patient with a ciliopathy resembling Alström syndrome.

Frontiers in genetics·2026
Same journal

Integrative analysis identifies Hspa5 as a key regulator of the ERS/UPR-immune axis in spinal cord injury.

Frontiers in genetics·2026
Same journal

Evaluation of genomic selection to improve survival of eastern oysters infected with <i>Perkinsus marinus</i>.

Frontiers in genetics·2026
Same journal

A rescue assay for genetic diagnosis of oculocutaneous albinism using melanocytic MNT1 knock-out cells.

Frontiers in genetics·2026
See all related articles

Related Experiment Video

Updated: Aug 26, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.1K

Deep learning methods may not outperform other machine learning methods on analyzing genomic studies.

Yao Dong1,2,3, Shaoze Zhou2, Li Xing4

  • 1School of Artifcial Intelligence, Hebei University of Technology, Tianjin, China.

Frontiers in Genetics
|October 10, 2022
PubMed
Summary
This summary is machine-generated.

Deep learning (DL) methods often do not outperform traditional machine learning (ML) for genomic data analysis, even with large sample sizes. Non-deep ML models are recommended for genomic studies due to comparable or superior performance.

Keywords:
deep learningdisease predictiongenomic analysishit curveimbalance datamachine learning

More Related Videos

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

426
Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.2K

Related Experiment Videos

Last Updated: Aug 26, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.1K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

426
Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.2K

Area of Science:

  • Genomics
  • Biomedical Data Science
  • Machine Learning

Background:

  • Deep learning (DL) shows success in biomedical image processing but faces challenges with genomic data due to small sample sizes and lack of common structural patterns.
  • Genomic datasets often lack the typical structures (e.g., convolutional layers) that benefit pre-trained DL models.
  • Over-reliance on DL for genomic analysis necessitates a comparative evaluation against established non-deep machine learning (ML) methods.

Purpose of the Study:

  • To benchmark the performance of deep learning (DL) versus non-deep machine learning (ML) methods for analyzing genomic data across various sample sizes.
  • To assess the predictive accuracy of different models for three lung diseases (asthma, COPD, lung cancer) using UK Biobank data.
  • To provide guidance on the appropriate use of DL in genomic studies, considering sample size limitations.

Main Methods:

  • A benchmark study was conducted using UK Biobank data, including comprehensive genomic information (millions of Single-Nucleotide Polymorphisms) and patient characteristics.
  • Five prediction models were evaluated: three non-deep ML (Elastic Net, XGBoost, SVM) and two DL (DNN, LSTM).
  • Performance was assessed using standard metrics like F1-score and the hit curve for rare event prediction.

Main Results:

  • Deep learning (DL) methods frequently underperformed non-deep machine learning (ML) models in analyzing genomic data, even with datasets exceeding 200,000 samples.
  • Performance differences between DL and non-deep ML diminished as sample size increased.
  • The study suggests that DL methods may offer benefits only with genomic datasets significantly larger than those analyzed.

Conclusions:

  • Non-deep machine learning (ML) methods are often more suitable and perform comparably or better than deep learning (DL) for current genomic data analysis.
  • The findings caution against the overuse of DL in genomic studies, even with biobank-scale data.
  • Increasing sample sizes may enhance DL performance in genomics, but current evidence favors established ML approaches for typical genomic datasets.