Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

3.9K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
3.9K
Improving Translational Accuracy02:07

Improving Translational Accuracy

2.5K
2.5K
From DNA to Protein03:06

From DNA to Protein

17.8K
The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
17.8K
Cis-regulatory Sequences02:02

Cis-regulatory Sequences

9.6K
Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...
9.6K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

93
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
93
Neural Circuits01:25

Neural Circuits

945
Neural circuits and neuronal pools are two of the main structures found in the nervous system. Neural circuits are networks of neurons that work together to carry out a specific task or process. They consist of interconnected neurons and glial cells, which provide structural and metabolic support.
Neuronal pools are collections of nerve cells with similar functions and interact through chemical and electrical signals. These pools include both interneurons (the central neural circuit nodes that...
945

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A synthetic cell phage cycle.

Nature communications·2025
Same author

Building a Synthetic Cell Together.

Nature communications·2025
Same author

qByte: An open-source isothermal fluorimeter for democratizing analysis of nucleic acids, proteins and cells.

PLoS biology·2025
Same author

Phage-mediated intercellular CRISPRi for biocomputation in bacterial consortia.

Nucleic acids research·2024
Same author

PHEIGES: all-cell-free phage synthesis and selection from engineered genomes.

Nature communications·2024
Same author

Cell-free biosynthesis combined with deep learning accelerates de novo-development of antimicrobial peptides.

Nature communications·2023
Same journal

Kat5 deficiency in alveolar type II cells licenses STAT6-driven glycolytic reprogramming and pulmonary fibrosis.

Nature communications·2026
Same journal

Continuous nonthermal slab gap formed by progressive tearing beneath Northeast Asia.

Nature communications·2026
Same journal

Zeolitic isolated protonic acid sites-mediated NH<sub>3</sub> storage for robust NO<sub>x</sub> removal.

Nature communications·2026
Same journal

Coaxially nested component with asymmetric fiber resonant cavity and separation membrane for gaseous and dissolved gases detection.

Nature communications·2026
Same journal

Near-unity charge readout signal in a nonlinear resonator without matching the sensor dissipation.

Nature communications·2026
Same journal

Prokaryotic Schlafen proteins cleave tRNAs during type III CRISPR immunity.

Nature communications·2026
See all related articles

Related Experiment Video

Updated: May 17, 2025

Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

922

CodonTransformer: a multispecies codon optimizer using context-aware neural networks.

Adibvafa Fallahpour1,2, Vincent Gureghian3,4, Guillaume J Filion5

  • 1Vector Institute for Artificial Intelligence, Toronto, ON, Canada.

Nature Communications
|April 3, 2025
PubMed
Summary
This summary is machine-generated.

CodonTransformer, a deep learning model, optimizes DNA sequences for specific organisms by learning from natural sequences. This tool generates natural-like codon usage and minimizes negative elements for improved gene expression.

More Related Videos

Identifying Amino Acid Overproducers Using Rare-Codon-Rich Markers
10:41

Identifying Amino Acid Overproducers Using Rare-Codon-Rich Markers

Published on: June 24, 2019

7.5K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.3K

Related Experiment Videos

Last Updated: May 17, 2025

Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

922
Identifying Amino Acid Overproducers Using Rare-Codon-Rich Markers
10:41

Identifying Amino Acid Overproducers Using Rare-Codon-Rich Markers

Published on: June 24, 2019

7.5K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.3K

Area of Science:

  • Computational Biology
  • Genomics
  • Machine Learning

Background:

  • Genetic code degeneracy allows multiple DNA sequences for one protein.
  • Optimizing codon usage is computationally complex due to combinatorial explosion.
  • Natural sequences offer insights into evolutionary codon usage rules.

Purpose of the Study:

  • To develop a deep learning model for optimizing DNA sequences across multiple species.
  • To generate host-specific DNA sequences with natural codon distribution and minimal regulatory elements.

Main Methods:

  • Developed CodonTransformer, a multispecies deep learning model using a Transformer architecture.
  • Trained the model on over 1 million DNA-protein pairs from 164 organisms.
  • Implemented a sequence representation strategy combining organism, amino acid, and codon encodings.
  • Introduced the Shared Token Representation and Encoding with Aligned Multi-masking (STREAM) strategy.

Main Results:

  • CodonTransformer demonstrates context-aware sequence generation.
  • Generated DNA sequences exhibit natural-like codon distribution profiles.
  • Sequences generated have minimal negative cis-regulatory elements.
  • The model is adaptable for host-specific codon optimization.

Conclusions:

  • CodonTransformer provides an effective framework for codon optimization.
  • The open-access model and Colab interface facilitate broader application.
  • This approach leverages machine learning to address challenges in synthetic biology and genetic engineering.