Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

13.9K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
13.9K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.6K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.6K
Conservation of Protein Domains02:26

Conservation of Protein Domains

3.9K
3.9K
Protein Families02:47

Protein Families

16.5K
Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key...
16.5K
Protein Families02:47

Protein Families

4.1K
4.1K
Protein Organization01:24

Protein Organization

8.9K
Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence....
8.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

[AI-Assisted ECG diagnostics : Classical test statistics still apply].

Herzschrittmachertherapie & Elektrophysiologie·2026
Same author

Multi-institutional classification of fibroglandular tissue and background parenchymal enhancement in breast MRI using deep learning.

Journal of medical imaging (Bellingham, Wash.)·2026
Same author

Early planting and early maturing sunflower hybrids are neglected management tools to remedy outbreaks of the red sunflower seed weevil (Coleoptera: Curculionidae).

Environmental entomology·2026
Same author

Multimodal Deep Learning for Prediction of Progression-Free Survival in Patients with Neuroendocrine Tumors Undergoing <sup>177</sup>Lu-Based Peptide Receptor Radionuclide Therapy.

Cancers·2026
Same author

MIMIC-III-Ext-PPG, a PPG-based Benchmark Dataset for Cardiovascular and Respiratory Signal Analysis.

Scientific data·2026
Same author

Detection of calcified plaques: comparison between coronary CT angiography and thin-slice non-contrast CT with deep learning-aided image registration.

European radiology·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Dec 31, 2025

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.9K

UDSMProt: universal deep sequence models for protein classification.

Nils Strodthoff1, Patrick Wagner1, Markus Wenzel1

  • 1Department of Video Coding & Analytics, Fraunhofer Heinrich Hertz Institute, Berlin 10587, Germany.

Bioinformatics (Oxford, England)
|January 9, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces a universal deep sequence model for protein classification, outperforming existing methods by leveraging self-supervised learning. This approach infers protein properties directly from amino acid sequences, advancing bioinformatics and omics research.

More Related Videos

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.6K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.4K

Related Experiment Videos

Last Updated: Dec 31, 2025

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.9K
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.6K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.4K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Protein property inference from amino acid sequences is a fundamental bioinformatics challenge.
  • Current methods often rely on task-specific, handcrafted features and extensive database searches.
  • There is a need for more efficient and generalizable approaches to protein classification.

Purpose of the Study:

  • To develop a universal deep sequence model for protein classification.
  • To demonstrate that a task-agnostic representation learned via self-supervised language modeling can be effectively transferred to specific tasks.
  • To evaluate the model's performance on enzyme class prediction, gene ontology prediction, and remote homology/fold detection.

Main Methods:

  • Pre-training a deep sequence model on unlabeled protein sequences from Swiss-Prot using self-supervised language modeling.
  • Fine-tuning the pre-trained model on downstream protein classification tasks.
  • Applying the model to enzyme class prediction, gene ontology prediction, and remote homology/fold detection.

Main Results:

  • The universal deep sequence model achieves performance on par with or superior to state-of-the-art methods on three distinct protein classification tasks.
  • The model successfully infers protein properties directly from amino acid sequences.
  • The study highlights the potential of natural language processing methods in omics data analysis.

Conclusions:

  • A universal deep sequence model pre-trained with self-supervised learning offers a powerful and efficient approach to protein classification.
  • This method demonstrates the feasibility of inferring protein properties solely from sequence data.
  • The findings underscore the broad applicability of modern natural language processing techniques in biological sequence analysis.