Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Leaky Scanning02:28

Leaky Scanning

5.1K
During most eukaryotic translation processes, the small 40S ribosome subunit scans an mRNA from its 5' end until it encounters the first start AUG codon. The large 60S ribosomal subunit then joins the smaller one to initiate protein synthesis. The location of the translation initiation is largely determined by the nucleotides near the start codon as there may be multiple translation initiation sites present on the mRNA.  Marilyn Kozak discovered that the sequence RCCAUGG (where R...
5.1K
Initiation of Translation02:33

Initiation of Translation

32.4K
Initiating translation is complex because it involves multiple molecules. Initiator tRNA, ribosomal subunits, and eukaryotic initiation factors (eIFs) are all required to assemble on the initiation codon of mRNA. This process consists of several steps that are mediated by different eIFs.
First, the initiator tRNA must be selected from the pool of elongator tRNAs by eukaryotic initiation factor 2 (eIF2). The initiator tRNA (Met-tRNAi) has conserved sequence elements including modified bases at...
32.4K
From DNA to Protein03:06

From DNA to Protein

18.3K
The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
18.3K
Improving Translational Accuracy02:07

Improving Translational Accuracy

10.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
10.0K
Ribosome Profiling02:24

Ribosome Profiling

3.5K
Ribosome profiling or ribo-sequencing is a deep sequencing technique that produces a snapshot of active translation in a cell. It selectively sequences the mRNAs protected by ribosomes to get an insight into a cell’s translation landscape at any given point in time.
Applications of ribosome profiling
Ribosome profiling has many applications, including in vivo monitoring of translation inside a particular organ or tissue type and quantifying new protein synthesis levels.
The technique...
3.5K
The Central Dogma01:25

The Central Dogma

125.1K
Overview
125.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Optimizing Bispecific Antibody Expression via Multi-Omics Analysis and Vector Redesign.

Biotechnology and bioengineering·2026
Same author

Decoding the language of messenger RNA.

Nature methods·2026
Same author

SenSet defines cell-type specific senescence signatures in the aged human lung.

The EMBO journal·2026
Same author

Identification of computationally designed skeletal and cardiac promoters with high specificity across AAV delivery routes.

BMC biotechnology·2026
Same author

MolQuery: Prediction of Lipid Synthesizability Using Active Learning.

ACS omega·2026
Same author

Single-cell atlas of human lung aging identifies cell type dyssynchrony and increased transcriptional entropy.

Nature communications·2026
Same journal

A unified analysis of cell type- and trajectory-associated pathways in single-cell data using Phoenix.

Genome research·2026
Same journal

Resf1 is required for proper placental development and configuration of trophoblast cell-specific heterochromatin.

Genome research·2026
Same journal

Telomere-driven replicative crisis is driven by large-scale changes in genomic architecture.

Genome research·2026
Same journal

Spatially informed reference-free cell-type deconvolution for spatial transcriptomics with SpatialCD.

Genome research·2026
Same journal

Spatially resolved profiling of steroid nuclear receptors reveals a role for the disordered N-terminal domains in genome targeting and AP-1 interaction.

Genome research·2026
Same journal

Flexible and scalable inference of spatially varying correlation in spatial transcriptomics with spCorr.

Genome research·2026
See all related articles

Related Experiment Video

Updated: Jun 22, 2025

Synthesis and Characterization of mRNA-Loaded PolyBeta Aminoesters Nanoparticles for Vaccination Purposes
08:27

Synthesis and Characterization of mRNA-Loaded PolyBeta Aminoesters Nanoparticles for Vaccination Purposes

Published on: August 13, 2021

4.5K

CodonBERT large language model for mRNA vaccines.

Sizhen Li1, Saeed Moayedpour1, Ruijiang Li1

  • 1Digital R&D, Sanofi, Cambridge, Massachusetts 02141, USA.

Genome Research
|July 1, 2024
PubMed
Summary
This summary is machine-generated.

We developed CodonBERT, a novel large language model (LLM) for messenger RNA (mRNA) sequence optimization. CodonBERT improves mRNA design for better expression, stability, and immunogenicity, outperforming existing methods.

More Related Videos

Measurement of Specific Mycobacterial Mistranslation Rates with Gain-of-function Reporter Systems
06:18

Measurement of Specific Mycobacterial Mistranslation Rates with Gain-of-function Reporter Systems

Published on: April 26, 2019

6.0K
Bacterial Artificial Chromosomes: A Functional Genomics Tool for the Study of Positive-strand RNA Viruses
12:20

Bacterial Artificial Chromosomes: A Functional Genomics Tool for the Study of Positive-strand RNA Viruses

Published on: December 29, 2015

21.4K

Related Experiment Videos

Last Updated: Jun 22, 2025

Synthesis and Characterization of mRNA-Loaded PolyBeta Aminoesters Nanoparticles for Vaccination Purposes
08:27

Synthesis and Characterization of mRNA-Loaded PolyBeta Aminoesters Nanoparticles for Vaccination Purposes

Published on: August 13, 2021

4.5K
Measurement of Specific Mycobacterial Mistranslation Rates with Gain-of-function Reporter Systems
06:18

Measurement of Specific Mycobacterial Mistranslation Rates with Gain-of-function Reporter Systems

Published on: April 26, 2019

6.0K
Bacterial Artificial Chromosomes: A Functional Genomics Tool for the Study of Positive-strand RNA Viruses
12:20

Bacterial Artificial Chromosomes: A Functional Genomics Tool for the Study of Positive-strand RNA Viruses

Published on: December 29, 2015

21.4K

Area of Science:

  • Biotechnology
  • Computational Biology
  • Genomics

Background:

  • Messenger RNA (mRNA) technology is increasingly vital for vaccines and therapeutics.
  • Optimizing mRNA sequences is crucial for enhancing expression, stability, and immunogenicity.
  • A vast number of mRNA sequences can encode even small proteins, necessitating efficient design tools.

Purpose of the Study:

  • To develop an advanced large language model (LLM) for optimizing mRNA sequences.
  • To create a tool that captures complex biological concepts within mRNA sequences.
  • To improve the prediction of mRNA properties for therapeutic and vaccine development.

Main Methods:

  • Developed CodonBERT, an LLM that utilizes codons as input for enhanced representation learning.
  • Trained CodonBERT on over 10 million diverse mRNA sequences from various organisms.
  • Evaluated CodonBERT's performance on predicting mRNA properties and compared it to existing methods.

Main Results:

  • CodonBERT effectively learns representations from mRNA sequences using codons as input.
  • The model demonstrates the ability to capture significant biological insights from mRNA data.
  • CodonBERT achieved superior performance in mRNA property prediction compared to previous methods, including on a novel influenza vaccine dataset.

Conclusions:

  • CodonBERT represents a significant advancement in mRNA sequence design and optimization.
  • The LLM provides a powerful tool for enhancing the development of mRNA-based vaccines and therapeutics.
  • Further applications of CodonBERT can extend to predicting diverse mRNA-related biological properties.