Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Gene Duplication and Divergence02:37

Gene Duplication and Divergence

8.1K
The seminal work of Ohno in 1970 popularized the idea of gene duplication and divergence. DNA sequence comparison studies reveal that a large portion of the genes in bacteria, archaebacteria, and eukaryotes was  generated by gene duplication and divergence, indicating its critical role in evolution.
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are...
8.1K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

8.3K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
8.3K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

3.7K
3.7K
Gene Families01:57

Gene Families

10.1K
Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...
10.1K
Genomics02:02

Genomics

41.1K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
41.1K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

3.5K
3.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Classifying Convergences in the Light of Horizontal Gene Transfer: Epaktovars and Xenotypes.

Molecular biology and evolution·2025
Same author

Contingency, repeatability, and predictability in the evolution of a prokaryotic pangenome.

Proceedings of the National Academy of Sciences of the United States of America·2023
Same author

Prokaryotic Pangenomes Act as Evolving Ecosystems.

Molecular biology and evolution·2022
Same author

Gene essentiality evolves across a pangenome.

Nature microbiology·2022
Same author

The role of public goods in planetary evolution.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2017
Same author

Function-related replacement of bacterial siderophore pathways.

The ISME journal·2017
Same journal

Phylogenomic blind spots: The limits of UCE and BUSCO loci in the presence of gene flow.

Molecular biology and evolution·2026
Same journal

seqLens: optimizing language models for genomic predictions.

Molecular biology and evolution·2026
Same journal

The transcriptional and translational outcomes for pseudogenes in bacterial endosymbionts.

Molecular biology and evolution·2026
Same journal

800 million years of co-evolution in the green plant lineage - the case of LEUNIG and SEUSS transcriptional co-regulators.

Molecular biology and evolution·2026
Same journal

RNA i-motif landscapes in plant kingdom and their potential functional roles.

Molecular biology and evolution·2026
Same journal

Functional Divergence and Structural Changes of class IV Histone Deacetylases (HDACs) Across the Tree of Life.

Molecular biology and evolution·2026
See all related articles

Related Experiment Video

Updated: Feb 26, 2026

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.6K

Genomic perplexity and the evolution of context-dependent function.

James O McInerney1

  • 1Department of Evolution, Ecology and Behaviour, Institute of Infection, Veterinary, and Ecological Sciences, Biosciences Building, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK.

Molecular Biology and Evolution
|February 25, 2026
PubMed
Summary
This summary is machine-generated.

Genomes function like large language models, encoding context-dependent probabilities rather than fixed gene functions. Genomic perplexity quantifies gene incompatibility, explaining fitness costs in gene flow.

Keywords:
genomic perplexityhorizontal gene transferinformation theorymachine learningmutation

More Related Videos

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

4.0K
In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila
06:41

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila

Published on: August 20, 2019

14.4K

Related Experiment Videos

Last Updated: Feb 26, 2026

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.6K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

4.0K
In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila
06:41

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila

Published on: August 20, 2019

14.4K

Area of Science:

  • Genomics
  • Evolutionary Biology
  • Computational Biology

Background:

  • Traditional genetics assumes fixed gene functions, but pangenomics and GWAS show context-dependent organismal functions.
  • Genomic context significantly influences gene and organismal phenotypes, even for core genes.

Purpose of the Study:

  • To propose a novel framework viewing genomes as probability distributions over functional outcomes, analogous to large language models (LLMs).
  • To introduce and define 'genomic perplexity' as an information-theoretic measure of genetic element incompatibility within a genomic context.
  • To provide a testable framework for predicting gene integration potential and understanding evolutionary processes.

Main Methods:

  • Conceptual analogy between genomic epistasis and LLM attention mechanisms.
  • Introduction of information-theoretic concept 'genomic perplexity'.
  • Demonstration of perplexity as a metric for fitness costs in horizontal gene transfer (HGT) and introgression.

Main Results:

  • Genomes, like LLMs, encode probability distributions of function, not fixed functions.
  • Genomic epistasis is analogous to attention mechanisms, where context weights genetic element influence.
  • Genomic perplexity quantifies the incompatibility of genetic elements, explaining fitness costs in gene flow.

Conclusions:

  • A probabilistic, LLM-inspired framework reframes our understanding of gene function and genomic interactions.
  • Genomic perplexity offers a quantifiable metric for gene-environment interactions and evolutionary integration.
  • This perspective advances synthetic biology, evolutionary modeling, and our understanding of genomic adaptation.