Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein and Protein Structure02:15

Protein and Protein Structure

82.5K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
82.5K
Protein Families02:47

Protein Families

16.0K
Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key...
16.0K
Protein Networks02:26

Protein Networks

2.4K
2.4K
Protein and Protein Structures02:15

Protein and Protein Structures

11.7K
11.7K
Protein Folding01:22

Protein Folding

123.0K
Overview
123.0K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

12.9K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
12.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Description of phosphate hydrolysis reactions with the Self-Consistent-Charge Density-Functional-Tight-Binding (SCC-DFTB) theory. 1. Parameterization.

Journal of chemical theory and computation·2011
Same author

Protein subcellular multi-localization prediction using a min-max modular support vector machine.

International journal of neural systems·2010
Same author

[Regulatory functions of Pax gene family in Drosophila development].

Yi chuan = Hereditas·2010
Same author

Lymphoma endothelium preferentially expresses Tim-3 and facilitates the progression of lymphoma by mediating immune evasion.

The Journal of experimental medicine·2010
Same author

Distinguishing the viability of a single yeast cell with an ultra-sensitive radio frequency sensor.

Lab on a chip·2010
Same author

Controllable synthesis and luminescent properties of novel erythrocyte-like CaMoO4 hierarchical nanostructures via a simple surfactant-free hydrothermal route.

Dalton transactions (Cambridge, England : 2003)·2010
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Oct 4, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.1K

ProtPlat: an efficient pre-training platform for protein classification based on FastText.

Yuan Jin1, Yang Yang2

  • 1Department of Computer Science and Engineering, Shanghai Jiao Tong University, and Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai, 200240, China.

BMC Bioinformatics
|February 12, 2022
PubMed
Summary
This summary is machine-generated.

We developed ProtPlat, a novel pre-training platform for protein sequences, to improve machine learning predictions. This method enhances amino acid representations and boosts classification performance, especially for limited data scenarios.

Keywords:
Pre-trainingProtPlatProtein sequence classificationWeb server

More Related Videos

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.1K
Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames
07:38

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Published on: April 11, 2019

12.9K

Related Experiment Videos

Last Updated: Oct 4, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.1K
A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.1K
Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames
07:38

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Published on: April 11, 2019

12.9K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Machine Learning

Background:

  • Machine learning for protein sequence analysis is limited by insufficient labeled data.
  • Pre-training methods are successful in other fields but underexplored for protein sequences.

Purpose of the Study:

  • To develop a general pre-training platform for protein sequences to enhance feature representation.
  • To improve performance on sequence-based classification tasks, particularly with small datasets.

Main Methods:

  • ProtPlat platform utilizes the Pfam database for large-scale supervised pre-training.
  • A three-layer neural network and FastText model are employed for efficient learning.
  • The pre-trained model is fine-tuned on specific downstream task data.

Main Results:

  • ProtPlat learns effective amino acid representations and achieves efficient classification.
  • Experiments on three tasks (effector identification, localization, signal peptide recognition) show performance enhancement.
  • ProtPlat is competitive with state-of-the-art predictors, outperforming them on small datasets.

Conclusions:

  • ProtPlat effectively enhances feature representation for protein amino acid sequences.
  • The platform improves performance in sequence-based classification tasks.
  • ProtPlat is a valuable tool for protein sequence analysis, accessible as a public web service.