Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Viral Mutations00:36

Viral Mutations

32.7K
A mutation is a change in the sequence of bases of DNA or RNA in a genome. Some mutations occur during replication of the genome due to errors made by the polymerase enzymes that replicate DNA or RNA. Unlike DNA polymerase, RNA polymerase is prone to errors because it is not capable of “proofreading” its work. Viruses with RNA-based genomes, like HIV, therefore accrue mutations faster than viruses with DNA-based genomes. Because mutation and recombination provide the raw material...
32.7K
Leaky Scanning02:28

Leaky Scanning

5.2K
During most eukaryotic translation processes, the small 40S ribosome subunit scans an mRNA from its 5' end until it encounters the first start AUG codon. The large 60S ribosomal subunit then joins the smaller one to initiate protein synthesis. The location of the translation initiation is largely determined by the nucleotides near the start codon as there may be multiple translation initiation sites present on the mRNA.  Marilyn Kozak discovered that the sequence RCCAUGG (where R...
5.2K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

2.5K
2.5K
Size and Structure of Viral Genomes01:26

Size and Structure of Viral Genomes

101
Viral genomes exhibit remarkable diversity in size, structure, and composition, influencing their replication strategies and interactions with host cells. These genomes consist of either DNA or RNA and may be linear or circular. Additionally, they can be single-stranded or double-stranded, with each configuration affecting how the virus propagates within a host. RNA viruses, for instance, generally have smaller genomes than DNA viruses, a factor that contributes to their high mutation rates and...
101
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.3K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.3K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.1K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Pharmacological profiling of intravenous MP-04: sustained NAD<sup>+</sup> augmentation, immune modulation, and renal protection in preclinical models.

Frontiers in pharmacology·2026
Same author

Comprehensive Metabolomic Analysis of Saliva Using SWATH-DIA Reveals Systemic Metabolic Adaptations to Exercise.

ACS omega·2026
Same author

Dose-Dependent Effects of Dihydronicotinamide Riboside on Human Engineered Skeletal Muscle Development.

ACS biomaterials science & engineering·2026
Same author

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models.

Advances in neural information processing systems·2026
Same author

Mechanistic insights into the association and activation of the SARS-CoV-2 2'-O-Methyltransferase (NSP16).

bioRxiv : the preprint server for biology·2026
Same author

Computational microbiology: Where is artificial intelligence addressing the barriers to large-scale simulations of bacterial cell envelopes?

Current opinion in structural biology·2026
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Aug 19, 2025

Production of a SARS-CoV-2 Virus-Like-Particle System to Investigate Viral Life Cycles In Vitro
09:26

Production of a SARS-CoV-2 Virus-Like-Particle System to Investigate Viral Life Cycles In Vitro

Published on: June 6, 2025

567

GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics.

Maxim Zvyagin1, Alexander Brace1,2, Kyle Hippe1

  • 1Argonne National Laboratory.

Biorxiv : the Preprint Server for Biology
|December 1, 2022
PubMed
Summary
This summary is machine-generated.

Genome-scale language models (GenSLMs) rapidly identify SARS-CoV-2 variants of concern by learning viral evolution. These foundation models generalize to other prediction tasks, transforming pandemic response.

Keywords:
AICOVID-19HPCLarge language modelsSARS-CoV-2whole genome analyses

More Related Videos

Following the Dynamics of Structural Variants in Experimentally Evolved Populations
04:52

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

1.0K
Modeling The Lifecycle Of Ebola Virus Under Biosafety Level 2 Conditions With Virus-like Particles Containing Tetracistronic Minigenomes
10:11

Modeling The Lifecycle Of Ebola Virus Under Biosafety Level 2 Conditions With Virus-like Particles Containing Tetracistronic Minigenomes

Published on: September 27, 2014

36.4K

Related Experiment Videos

Last Updated: Aug 19, 2025

Production of a SARS-CoV-2 Virus-Like-Particle System to Investigate Viral Life Cycles In Vitro
09:26

Production of a SARS-CoV-2 Virus-Like-Particle System to Investigate Viral Life Cycles In Vitro

Published on: June 6, 2025

567
Following the Dynamics of Structural Variants in Experimentally Evolved Populations
04:52

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

1.0K
Modeling The Lifecycle Of Ebola Virus Under Biosafety Level 2 Conditions With Virus-like Particles Containing Tetracistronic Minigenomes
10:11

Modeling The Lifecycle Of Ebola Virus Under Biosafety Level 2 Conditions With Virus-like Particles Containing Tetracistronic Minigenomes

Published on: September 27, 2014

36.4K

Area of Science:

  • Genomics
  • Virology
  • Artificial Intelligence

Background:

  • Accurate and rapid identification of novel viral variants, such as SARS-CoV-2, is crucial for pandemic control.
  • Existing methods for variant classification may not fully capture the complex evolutionary dynamics of rapidly mutating viruses.

Approach:

  • Adaptation of large language models (LLMs) for genomic data to create genome-scale language models (GenSLMs).
  • Pre-training GenSLMs on over 110 million prokaryotic gene sequences and fine-tuning on 1.5 million SARS-CoV-2 genomes.
  • Demonstration of GenSLM scalability on advanced computing infrastructure, including GPU supercomputers and AI accelerators, achieving high computational performance.

Key Points:

  • GenSLMs accurately and rapidly identify SARS-CoV-2 variants of concern.
  • These models represent early examples of whole-genome foundation models applicable to diverse prediction tasks.
  • The study showcases significant computational scaling, utilizing Zettaflops and PFLOPS for training.

Conclusions:

  • GenSLMs offer a novel approach to understanding viral evolution and variant identification.
  • The developed models provide initial scientific insights into tracking SARS-CoV-2 evolutionary dynamics.
  • This work paves the way for applying large-scale biological data analysis to future pandemic preparedness.