Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

13.5K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
13.5K
Conservation of Protein Domains02:26

Conservation of Protein Domains

3.5K
3.5K
Three-Domain System of Life01:21

Three-Domain System of Life

445
Ribosomal RNA (rRNA) sequence analysis revealed three distinct groups of cells: eukaryotes, bacteria, and archaea. In 1978, Carl R. Woese proposed the concept of domains, a taxonomic level above kingdoms, to differentiate these groups. He suggested that archaea and bacteria, despite their similar appearance, represent separate domains. Domains differ in rRNA, membrane lipid structure, transfer RNA, and antibiotic sensitivity.In this classification, animals, plants, and fungi belong to the...
445
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.8K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.8K
Improving Translational Accuracy02:07

Improving Translational Accuracy

12.1K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
12.1K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.2K
3.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The association of different dimensions of anhedonia in the relationship between depressive symptoms and self-harm in adolescents with mood disorders.

Frontiers in psychiatry·2026
Same author

Temperature variability and mortality risk: distinguishing intraday and interday effects and quantifying the attributable mortality burden in Chengdu, Southwest China.

Frontiers in public health·2026
Same author

Hematological biomarkers for predicting pathologic response to neoadjuvant immunochemotherapy and cycle optimization in locally advanced gastric cancer.

Frontiers in immunology·2026
Same author

Linkage-aware inference of fitness from short-read time-series genomic data.

Virus evolution·2026
Same author

Return to baseline arsenic concentrations after 1 year on gluten-free diet in children with celiac disease: A prospective cohort study.

JPGN reports·2026
Same author

A physically constrained and interpretable deep learning framework for PM<sub>2.5</sub> inversion under sparse monitoring conditions in arid regions.

Environmental pollution (Barking, Essex : 1987)·2026
Same journal

Metabolic reprogramming, oxidative stress, and mitophagy in JSRV Env-transformed BEAS-2B cells: insights from integrated transcriptomics and metabolomics.

BMC genomics·2026
Same journal

Integrated multi-population selection-signature scans identify putative functional genes influencing litter size in jishen black pigs.

BMC genomics·2026
Same journal

Evaluation of Dorado v5.2.0 de novo basecalling models for the detection of tRNA modifications using RNA004 chemistry.

BMC genomics·2026
Same journal

Genomeic and transcriptomic analysis suggests potential regulatory relationships among MC1R, TUBB3, and PMEL associated with black-flecked plumage in chickens.

BMC genomics·2026
Same journal

Gene co-expression networks related to intramuscular fatty acid composition across different pig genotypes.

BMC genomics·2026
Same journal

Honghe Bunya-like virus: a novel virus identified in mosquitoes from Yunnan, China.

BMC genomics·2026
See all related articles

Related Experiment Video

Updated: Nov 9, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.3K

Improving protein domain classification for third-generation sequencing reads using deep learning.

Nan Du1, Jiayu Shang2, Yanni Sun3

  • 1Computer Science and Engineering, Michigan State University, East Lansing, 48824, USA.

BMC Genomics
|April 10, 2021
PubMed
Summary
This summary is machine-generated.

ProDOMA is a novel deep learning tool for protein domain classification in long, noisy DNA reads from third-generation sequencing (TGS). It accurately identifies protein domains without needing error correction or assembly.

More Related Videos

Interactome-Seq: A Protocol for Domainome Library Construction, Validation and Selection by Phage Display and Next Generation Sequencing
12:04

Interactome-Seq: A Protocol for Domainome Library Construction, Validation and Selection by Phage Display and Next Generation Sequencing

Published on: October 3, 2018

9.2K
DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning
04:17

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

1.1K

Related Experiment Videos

Last Updated: Nov 9, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.3K
Interactome-Seq: A Protocol for Domainome Library Construction, Validation and Selection by Phage Display and Next Generation Sequencing
12:04

Interactome-Seq: A Protocol for Domainome Library Construction, Validation and Selection by Phage Display and Next Generation Sequencing

Published on: October 3, 2018

9.2K
DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning
04:17

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

1.1K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Third-generation sequencing (TGS) yields long DNA reads (10s to 100s of kb).
  • TGS enables protein domain annotation without assembly, offering biological insights.
  • High error rates in TGS data challenge existing domain analysis methods, reducing accuracy.

Purpose of the Study:

  • To develop a computational method for accurate protein domain prediction in long, noisy TGS reads.
  • To address the limitations of current domain analysis pipelines with TGS data.

Main Methods:

  • Introduction of ProDOMA, a deep learning model for protein domain classification.
  • Utilizes deep neural networks with 3-frame translation encoding to capture conserved features.
  • Formulates the problem as an open-set to enable rejection of reads lacking targeted domains.

Main Results:

  • ProDOMA demonstrates superior performance in protein domain classification compared to HMMER and DeepFam.
  • Experiments on simulated and real TGS human genome data validate ProDOMA's effectiveness.
  • The model successfully classifies domains in long, noisy reads.

Conclusions:

  • ProDOMA is an effective end-to-end tool for protein domain analysis on long, noisy reads.
  • The tool operates without the necessity of prior error correction.
  • ProDOMA enhances the utility of TGS data for biological function studies.