Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to form...
Evolution of Microbial Genome01:08

Evolution of Microbial Genome

Microbial genome evolution is a highly dynamic process shaped by continual gene gain and loss across species and strains. This genomic flexibility allows microorganisms to adapt rapidly to environmental pressures and interactions with other organisms. Central to understanding this diversity is the distinction between the core and pan genomes.The core genome comprises the genes shared by all sampled strains of a species, representing essential functions needed for fundamental cellular processes.
Conservation of Protein Domains02:26

Conservation of Protein Domains

Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to form...
From DNA to Protein03:06

From DNA to Protein

The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
Evolutionary Processes in Microbes01:26

Evolutionary Processes in Microbes

Microbial evolution occurs rapidly due to short generation times and a variety of genetic processes, including horizontal gene transfer, mutation, recombination, and genetic drift. These mechanisms collectively enable microbes to adapt swiftly to changing environments.Horizontal gene transfer (HGT) allows genes to move between different species and occurs through three main mechanisms: conjugation, transformation, and transduction. Conjugation involves direct cell-to-cell contact for DNA...
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ERGA-BGE reference genome of <i>Diadema setosum:</i> the Black Longspine Urchin invading the Mediterranean sea.

Open research Europe·2026
Same author

ERGA-BGE genome of <i>Dailognatha quadricollis,</i> an East Mediterranean darkling beetle.

Open research Europe·2026
Same author

An automated decision-making procedure for ranking and selecting species in biodiversity projects.

npj biodiversity·2026
Same author

Ecotypes, <i>Wolbachia</i>, and urbanization shape <i>Culex pipiens</i> population structure in a West Nile virus hotspot.

bioRxiv : the preprint server for biology·2026
Same author

Genomic Evidence of Hybridization Between Two Species in Nicotiana sect. Alatae.

Annals of botany·2026
Same author

VICAST: An Integrated Toolkit for Viral Genome Annotation Curation and Low-Frequency Variant Analysis in Passage Studies.

bioRxiv : the preprint server for biology·2026
Same journal

Beyond housekeeping: snRNA diversity, regulation, and human disease.

Trends in genetics : TIG·2026
Same journal

Rethinking mitochondrial metabolism: Intraindividual variability meets population constraints.

Trends in genetics : TIG·2026
Same journal

A role for epigenetics in rapid adaptation.

Trends in genetics : TIG·2026
Same journal

The myth of asexual fungi.

Trends in genetics : TIG·2026
Same journal

Co-transcriptional splicing: Distinct phases, mutual benefits, and basis for nuclear architecture.

Trends in genetics : TIG·2026
Same journal

From mobile DNA to medicine: Fusion Conference highlights the therapeutic potential of endogenous retroelements in human disease.

Trends in genetics : TIG·2026
See all related articles
  1. Home
  2. Rethinking Molecular Evolution Through Protein Language Model Embeddings.
  1. Home
  2. Rethinking Molecular Evolution Through Protein Language Model Embeddings.

Related Experiment Video

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

Rethinking molecular evolution through protein language model embeddings.

Rosa Fernández1, Sergi Valverde1, Aureliano Bombarely2

  • 1Institute of Evolutionary Biology (CSIC-UPF), Barcelona, Spain.

Trends in Genetics : TIG
|June 14, 2026

View abstract on PubMed

Summary
This summary is machine-generated.

Protein language models create embeddings that capture evolutionary data. This enables new geometric analyses of protein relationships, merging molecular evolution with embedding techniques.

Keywords:
deep learningevolutionary bioinformaticsgenomicsmachine learningphylogeneticsprotein language models

More Related Videos

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

Related Experiment Videos

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

Area of Science:

  • Computational biology
  • Bioinformatics
  • Evolutionary biology

Background:

  • Protein language models (PLMs) generate high-dimensional embeddings from protein sequences.
  • These embeddings implicitly learn biochemical, structural, and functional constraints without supervised labels.

Purpose of the Study:

  • To demonstrate that protein embeddings encode significant evolutionary information.
  • To introduce novel geometry-based methods for analyzing protein evolution using these embeddings.
  • To advocate for integrating evolutionary embedding analysis with classical molecular evolution.

Main Methods:

  • Analysis of protein sequence embeddings generated by unsupervised language models.
  • Application of geometric and topological methods to visualize and quantify evolutionary relationships.
  • Comparison of embedding-based evolutionary insights with traditional phylogenetic approaches.
  • Main Results:

    • Protein embeddings reveal rich evolutionary signals, including homology, divergence, and convergence.
    • Geometric analysis of embeddings provides new perspectives on evolutionary trajectories.
    • Embeddings capture complex evolutionary patterns not readily apparent through conventional methods.

    Conclusions:

    • Protein language model embeddings are a powerful resource for evolutionary analysis.
    • A synthesis of evolutionary embedding analysis and molecular evolution is proposed.
    • This approach offers novel tools for understanding protein evolution and diversification.