Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

tRNA Activation02:26

tRNA Activation

22.1K
Aminoacyl-tRNA synthetases are present in both eukaryotes and bacteria. Though eukaryotes have 20 different aminoacyl-tRNA synthetases to couple to 20 amino acids, many bacteria do not have genes for all of these aminoacyl-tRNA synthetases. Despite this, they still use all 20 amino acids to synthesize their proteins. For instance, some bacteria do not have the gene encoding the enzyme that couples glutamine with its partner tRNA. In these organisms, one enzyme adds glutamic acid to all of the...
22.1K
tRNA Activation02:26

tRNA Activation

8.1K
8.1K
Amino acids03:42

Amino acids

103.0K
Amino acids are the monomers that comprise proteins. Each amino acid has the same fundamental structure, which consists of a central carbon atom, or the alpha (α) carbon, bonded to an amino group (NH2), a carboxyl group (COOH), and to a hydrogen atom. Every amino acid also has another atom or group of atoms bonded to the central atom known as the R group. There are 20 common amino acids present in proteins, each with a different R group. Variation in the amino acid sequence is responsible for...
103.0K
From DNA to Protein03:06

From DNA to Protein

21.6K
The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
21.6K
Amino Acid Biosynthetic Pathways01:29

Amino Acid Biosynthetic Pathways

747
Amino acid biosynthesis is essential for cell growth, protein synthesis, and metabolic regulation. Cells generate essential and non-essential amino acids from metabolic intermediates to sustain vital biological functions. These intermediates originate from key metabolic pathways: glycolysis, the tricarboxylic acid (TCA) cycle, and the pentose phosphate pathway. Important precursors include α-ketoglutarate, pyruvate, oxaloacetate, phosphoenolpyruvate, and erythrose-4-phosphate, which...
747
Long-term Potentiation01:25

Long-term Potentiation

3.2K
Long-term potentiation, or LTP, is one of the ways by which synaptic plasticity—changes in the strength of chemical synapses—can occur in the brain. LTP is the process of synaptic strengthening that occurs over time between pre and postsynaptic neuronal connections. The synaptic strengthening of LTP works in opposition to the synaptic weakening of long-term depression (LTD) and together are the main mechanisms that underlie learning and memory.
Hebbian LTP
LTP can occur when...
3.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Whole metagenome sequencing: not deep enough for complete microbial function recovery.

Microbiome·2026
Same author

TOFU-MAaPO: fast, scalable and reproducible analysis of large metagenome sequence data from the Sequence Read Archive.

Nature communications·2026
Same author

Potential of Clinical Care-Collected Gut Biopsies for Advancing Personalised Medicine in Inflammatory Bowel Disease.

United European gastroenterology journal·2026
Same author

Prolonged Inflammation Associates With Greater Infarct Size and Poor Outcome After ST-Segment Elevation Myocardial Infarction.

JACC. Basic to translational science·2026
Same author

Large-scale analysis of temporal gene expression variation in peripheral blood.

Nature communications·2026
Same author

Exome sequencing directly implicates 68 genes in inflammatory bowel disease.

medRxiv : the preprint server for health sciences·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Dec 18, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.6K

Amino acid encoding for deep learning applications.

Hesham ElAbd1, Yana Bromberg2,3,4, Adrienne Hoarfrost2

  • 1Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany.

BMC Bioinformatics
|June 11, 2020
PubMed
Summary
This summary is machine-generated.

End-to-end learning of amino acid embeddings matches classical methods, even with limited data. This deep learning approach may reduce embedding dimensions for efficient model deployment.

Keywords:
Amino acid encodingAmino acids embeddingConvoluted-neural network (CNN)Deep-learningHLA-II peptide interactionHuman-leukocyte antigen (HLA)Machine-learning (ML)Protein-protein interaction (PPI)Recurrent neural network (RNN)

More Related Videos

Extraction of Non-Protein Amino Acids from Cyanobacteria for Liquid Chromatography-Tandem Mass Spectrometry Analysis
08:14

Extraction of Non-Protein Amino Acids from Cyanobacteria for Liquid Chromatography-Tandem Mass Spectrometry Analysis

Published on: December 9, 2022

4.1K
Decoding Natural Behavior from Neuroethological Embedding
08:00

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

457

Related Experiment Videos

Last Updated: Dec 18, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.6K
Extraction of Non-Protein Amino Acids from Cyanobacteria for Liquid Chromatography-Tandem Mass Spectrometry Analysis
08:14

Extraction of Non-Protein Amino Acids from Cyanobacteria for Liquid Chromatography-Tandem Mass Spectrometry Analysis

Published on: December 9, 2022

4.1K
Decoding Natural Behavior from Neuroethological Embedding
08:00

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

457

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Machine Learning

Background:

  • Deep learning excels in bioinformatics, often outperforming classical methods with large datasets.
  • Amino acids are typically represented by continuous vectors via embedding matrices in deep learning.
  • End-to-end learning optimizes embeddings directly from data, yielding state-of-the-art results.

Purpose of the Study:

  • Systematically evaluate end-to-end learning for single amino acid encoding.
  • Compare end-to-end learned embeddings against classical encoding strategies (one-hot, VHSE8, BLOSUM62).
  • Assess performance across different deep learning architectures (RNN, CNN, CNN-RNN).

Main Methods:

  • Implemented and compared classical encoding matrices with end-to-end learned amino acid embeddings.
  • Utilized recurrent neural networks (RNN), convolutional neural networks (CNN), and hybrid CNN-RNN architectures.
  • Evaluated performance on two distinct bioinformatics prediction tasks.

Main Results:

  • End-to-end learning achieved performance comparable to classical encodings, even with limited training data.
  • A reduction in embedding dimension was possible without compromising performance, crucial for resource-constrained devices.
  • Embedding dimension significantly impacts model performance; deep learning models can learn from random vectors.

Conclusions:

  • End-to-end learning offers a flexible and potent method for amino acid encoding in bioinformatics.
  • Benchmarking encoding schemes against random vectors is essential to differentiate information content from distinguishability effects.