Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Conservation of Protein Domains Over Different Proteins

Conservation of Protein Domains Over Different Proteins

Protein domains are small structurally independent units that are part of a single amino acid chain. Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...

Conservation of Protein Domains

Conservation of Protein Domains

Three-Domain System of Life

Three-Domain System of Life

Ribosomal RNA (rRNA) sequence analysis revealed three distinct groups of cells: eukaryotes, bacteria, and archaea. In 1978, Carl R. Woese proposed the concept of domains, a taxonomic level above kingdoms, to differentiate these groups. He suggested that archaea and bacteria, despite their similar appearance, represent separate domains. Domains differ in rRNA, membrane lipid structure, transfer RNA, and antibiotic sensitivity.In this classification, animals, plants, and fungi belong to the...

Genome Annotation and Assembly

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

The association of different dimensions of anhedonia in the relationship between depressive symptoms and self-harm in adolescents with mood disorders.

Frontiers in psychiatry·2026

Same author

Temperature variability and mortality risk: distinguishing intraday and interday effects and quantifying the attributable mortality burden in Chengdu, Southwest China.

Frontiers in public health·2026

Same author

Hematological biomarkers for predicting pathologic response to neoadjuvant immunochemotherapy and cycle optimization in locally advanced gastric cancer.

Frontiers in immunology·2026

Same author

Linkage-aware inference of fitness from short-read time-series genomic data.

Virus evolution·2026

Same author

Return to baseline arsenic concentrations after 1 year on gluten-free diet in children with celiac disease: A prospective cohort study.

JPGN reports·2026

Same author

A physically constrained and interpretable deep learning framework for PM<sub>2.5</sub> inversion under sparse monitoring conditions in arid regions.

Environmental pollution (Barking, Essex : 1987)·2026

Same journal

Metabolic reprogramming, oxidative stress, and mitophagy in JSRV Env-transformed BEAS-2B cells: insights from integrated transcriptomics and metabolomics.

BMC genomics·2026

Same journal

Integrated multi-population selection-signature scans identify putative functional genes influencing litter size in jishen black pigs.

BMC genomics·2026

Same journal

Evaluation of Dorado v5.2.0 de novo basecalling models for the detection of tRNA modifications using RNA004 chemistry.

BMC genomics·2026

Same journal

Genomeic and transcriptomic analysis suggests potential regulatory relationships among MC1R, TUBB3, and PMEL associated with black-flecked plumage in chickens.

BMC genomics·2026

Same journal

Gene co-expression networks related to intramuscular fatty acid composition across different pig genotypes.

BMC genomics·2026

Same journal

Honghe Bunya-like virus: a novel virus identified in mosquitoes from Yunnan, China.

BMC genomics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 9, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

Improving protein domain classification for third-generation sequencing reads using deep learning.

Nan Du¹, Jiayu Shang², Yanni Sun³

¹Computer Science and Engineering, Michigan State University, East Lansing, 48824, USA.

|April 10, 2021

Summary

This summary is machine-generated.

ProDOMA is a novel deep learning tool for protein domain classification in long, noisy DNA reads from third-generation sequencing (TGS). It accurately identifies protein domains without needing error correction or assembly.

More Related Videos

Interactome-Seq: A Protocol for Domainome Library Construction, Validation and Selection by Phage Display and Next Generation Sequencing

Interactome-Seq: A Protocol for Domainome Library Construction, Validation and Selection by Phage Display and Next Generation Sequencing

Published on: October 3, 2018

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

Related Experiment Videos

Last Updated: Nov 9, 2025

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

Interactome-Seq: A Protocol for Domainome Library Construction, Validation and Selection by Phage Display and Next Generation Sequencing

Interactome-Seq: A Protocol for Domainome Library Construction, Validation and Selection by Phage Display and Next Generation Sequencing

Published on: October 3, 2018

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

Area of Science:

Genomics
Bioinformatics
Computational Biology

Background:

Third-generation sequencing (TGS) yields long DNA reads (10s to 100s of kb).
TGS enables protein domain annotation without assembly, offering biological insights.
High error rates in TGS data challenge existing domain analysis methods, reducing accuracy.

Purpose of the Study:

To develop a computational method for accurate protein domain prediction in long, noisy TGS reads.
To address the limitations of current domain analysis pipelines with TGS data.

Main Methods:

Introduction of ProDOMA, a deep learning model for protein domain classification.
Utilizes deep neural networks with 3-frame translation encoding to capture conserved features.
Formulates the problem as an open-set to enable rejection of reads lacking targeted domains.

Main Results:

ProDOMA demonstrates superior performance in protein domain classification compared to HMMER and DeepFam.
Experiments on simulated and real TGS human genome data validate ProDOMA's effectiveness.
The model successfully classifies domains in long, noisy reads.

Conclusions:

ProDOMA is an effective end-to-end tool for protein domain analysis on long, noisy reads.
The tool operates without the necessity of prior error correction.
ProDOMA enhances the utility of TGS data for biological function studies.