Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genomics02:02

Genomics

36.2K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
36.2K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

18.8K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
18.8K
Synthetic Biology02:55

Synthetic Biology

4.7K
Synthetic biology is an interdisciplinary science that involves using principles from disciplines such as engineering, molecular biology, cell biology, and systems biology. It involves remodeling existing organisms from nature or constructing completely new synthetic organisms for applications such as protein or enzyme production, bioremediation, value-added macromolecule production, and the addition of desirable traits to crops, to name a few.
Golden rice
Golden rice is a genetically modified...
4.7K
What is Genetic Engineering?00:49

What is Genetic Engineering?

74.0K
Overview
74.0K
Ribosome Profiling02:24

Ribosome Profiling

3.5K
Ribosome profiling or ribo-sequencing is a deep sequencing technique that produces a snapshot of active translation in a cell. It selectively sequences the mRNAs protected by ribosomes to get an insight into a cell’s translation landscape at any given point in time.
Applications of ribosome profiling
Ribosome profiling has many applications, including in vivo monitoring of translation inside a particular organ or tissue type and quantifying new protein synthesis levels.
The technique...
3.5K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.7K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Replaying germinal center evolution on a quantified affinity landscape.

Cell·2026
Same author

Tree reconstruction guarantees from CRISPR-Cas9 lineage tracing data using Neighbor-Joining.

Genome research·2026
Same author

Hypervariable loop profiling decodes sequence determinants of antibody stability.

Nature structural & molecular biology·2026
Same author

Inference of germinal center evolutionary dynamics via simulation-based deep learning.

eLife·2026
Same author

Impact of device implantation depth on blood flow dynamics after left atrial appendage closure.

EuroIntervention : journal of EuroPCR in collaboration with the Working Group on Interventional Cardiology of the European Society of Cardiology·2026
Same author

Separating selection from mutation in antibody language models.

eLife·2026
Same journal

Poisoning the Genome: Targeted Backdoor Attacks on DNA Foundation Models.

ArXiv·2026
Same journal

Mechanistic mathematical model of the in vitro infection dynamics of Bunyamwera and Batai viruses including MOI-dependent shortening of the eclipse phase.

ArXiv·2026
Same journal

AI-Driven Lumped-Element Modeling of Human Respiratory System for Studying Voice Mechanics.

ArXiv·2026
Same journal

Beyond Algorithms: Conceptual Innovation in Medical Imaging AI.

ArXiv·2026
Same journal

Feynman Kac Reweighted Schrödinger Bridge Matching for Surface-Based Tau PET Harmonization.

ArXiv·2026
Same journal

Agentic Discovery of Non-Canonical Antimicrobial Peptides with AMPGAN v3.

ArXiv·2026
See all related articles

Related Experiment Video

Updated: Jun 18, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

534

Genomic Language Models: Opportunities and Challenges.

Gonzalo Benegas1, Chengzhong Ye2, Carlos Albors1

  • 1Computer Science Division, University of California, Berkeley.

Arxiv
|July 29, 2024
PubMed
Summary
This summary is machine-generated.

Genomic Language Models (gLMs), a type of large language model (LLM) trained on DNA, offer powerful new ways to understand genome function. These models show promise for applications like predicting functional constraints and designing DNA sequences.

More Related Videos

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.2K
Transcriptomic Analysis of C. elegans RNA Sequencing Data Through the Tuxedo Suite on the Galaxy Project
10:19

Transcriptomic Analysis of C. elegans RNA Sequencing Data Through the Tuxedo Suite on the Galaxy Project

Published on: April 8, 2017

17.4K

Related Experiment Videos

Last Updated: Jun 18, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

534
Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.2K
Transcriptomic Analysis of C. elegans RNA Sequencing Data Through the Tuxedo Suite on the Galaxy Project
10:19

Transcriptomic Analysis of C. elegans RNA Sequencing Data Through the Tuxedo Suite on the Galaxy Project

Published on: April 8, 2017

17.4K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Large language models (LLMs) are increasingly impacting scientific research, especially in the biomedical field.
  • Understanding biological sequences, particularly DNA, is a key objective in modern biology.
  • Genomic Language Models (gLMs) leverage LLMs to analyze DNA sequences, aiming to decipher genomic functions.

Purpose of the Study:

  • To highlight the potential of genomic language models (gLMs) in advancing our understanding of genomes.
  • To showcase key applications of gLMs, including functional constraint prediction, sequence design, and transfer learning.
  • To discuss critical considerations for developing and evaluating gLMs, particularly for complex genomes.

Main Methods:

  • Training large language models (LLMs) on vast datasets of DNA sequences to create genomic language models (gLMs).
  • Applying gLMs to tasks such as predicting functional constraints within genomic sequences.
  • Utilizing gLMs for de novo sequence design and for transfer learning across different genomic datasets.

Main Results:

  • Demonstrated potential of gLMs in predicting functional constraints on DNA sequences.
  • Showcased gLMs' capabilities in designing novel DNA sequences with desired properties.
  • Highlighted the utility of gLMs in transfer learning for genomic data analysis.

Conclusions:

  • Genomic language models (gLMs) represent a significant advancement in analyzing and understanding genomic data.
  • Effective development and evaluation of gLMs are crucial, especially for large and complex genomes.
  • gLMs hold substantial promise for future discoveries in genomics and related biomedical fields.