Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

DNA as a Genetic Template02:05

DNA as a Genetic Template

21.9K
Two structural features of the DNA molecule provide a basis for the mechanisms of heredity: the four nucleotide bases and its double-stranded nature. The Watson-Crick model of double-helical DNA structure, proposed in 1952, drew heavily upon the X-ray crystallography work of researchers Rosalind Franklin and Maurice Wilkins. Watson, Crick, and Wilkins jointly received the Nobel Prize in Physiology or Medicine for their work in 1962. Franklin was, controversially, excluded from the prize for...
21.9K
DNA Base Pairing02:27

DNA Base Pairing

27.4K
Erwin Chargaff’s rules on DNA equivalence paved the way for the discovery of base pairing in DNA. Chargaff’s rules state that in a double-stranded DNA molecule,
27.4K
Encoding01:19

Encoding

168
Information enters the brain through encoding, which is the input of information into the memory system. Once sensory information is received from the environment, the brain labels or codes it. The information is then organized with similar information and connected to existing concepts. Encoding occurs through automatic processing and effortful processing.
Automatic processing involves the encoding of details like time, space, frequency, and the meaning of words, usually done without conscious...
168
Nucleic Acid Structure01:25

Nucleic Acid Structure

6.1K
The pentose sugar in DNA is deoxyribose, while in RNA the pentose sugar is ribose. The difference between the sugars is the presence of the hydroxyl group on the ribose's second carbon and a hydrogen on the deoxyribose's second carbon. The phosphate residue attaches to the hydroxyl group of the 5′ carbon of one sugar and the hydroxyl group of the 3′ carbon of the sugar of the next nucleotide, which forms  a 5′ to 3′ phosphodiester linkage.
DNA Structure
DNA...
6.1K
From DNA to Protein03:06

From DNA to Protein

18.4K
The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
18.4K
Base-pairing and DNA Repair02:27

Base-pairing and DNA Repair

64.7K
64.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Direct pharmacological targeting of asparagine synthetase to overcome resistance to L-asparaginase in ALL therapy.

JCI insight·2026
Same author

Evaluation of season and sex specific differential gonadal transcriptomics in Himalayan snow trout Schizothorax plagiostomus, Heckel, 1838.

Scientific reports·2026
Same author

TURBO-RL: turbulence mitigation using reinforcement learning for severe optical aberrations.

Journal of the Optical Society of America. A, Optics, image science, and vision·2026
Same author

Canonical microRNA loss drives tumor development, implicating therapeutic efficacy of enoxacin in angiosarcoma.

RNA (New York, N.Y.)·2026
Same author

Exploring tumor dynamics and responses of prostate cancer to IL-27 based treatment combinations through biodynamic imaging and RNA sequencing analyses.

Scientific reports·2025
Same author

The nonsteroidal anti-inflammatory drug sulindac reverses obesity-driven immunosuppression and triple-negative breast cancer progression.

Breast cancer research : BCR·2025
Same journal

Poisoning the Genome: Targeted Backdoor Attacks on DNA Foundation Models.

ArXiv·2026
Same journal

Mechanistic mathematical model of the in vitro infection dynamics of Bunyamwera and Batai viruses including MOI-dependent shortening of the eclipse phase.

ArXiv·2026
Same journal

AI-Driven Lumped-Element Modeling of Human Respiratory System for Studying Voice Mechanics.

ArXiv·2026
Same journal

Beyond Algorithms: Conceptual Innovation in Medical Imaging AI.

ArXiv·2026
Same journal

Feynman Kac Reweighted Schrödinger Bridge Matching for Surface-Based Tau PET Harmonization.

ArXiv·2026
Same journal

Agentic Discovery of Non-Canonical Antimicrobial Peptides with AMPGAN v3.

ArXiv·2026
See all related articles

Related Experiment Video

Updated: Jul 2, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.0K

Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision.

Aditya Malusare1,2, Harish Kothandaraman2, Dipesh Tamboli3

  • 1School of Industrial Engineering, Purdue University, USA.

Arxiv
|February 27, 2024
PubMed
Summary
This summary is machine-generated.

The Ensemble Nucleotide Byte-level Encoder-Decoder (ENBED) model analyzes DNA at byte-level precision. This foundation model achieves state-of-the-art results in genomic sequence analysis and mutation generation.

Keywords:
AttentionDNAGene AnnotationMutationsTransformers

More Related Videos

DNA-Tethered RNA Polymerase for Programmable In vitro Transcription and Molecular Computation
09:26

DNA-Tethered RNA Polymerase for Programmable In vitro Transcription and Molecular Computation

Published on: December 29, 2021

4.2K
Analyzing and Building Nucleic Acid Structures with 3DNA
16:24

Analyzing and Building Nucleic Acid Structures with 3DNA

Published on: April 26, 2013

20.6K

Related Experiment Videos

Last Updated: Jul 2, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.0K
DNA-Tethered RNA Polymerase for Programmable In vitro Transcription and Molecular Computation
09:26

DNA-Tethered RNA Polymerase for Programmable In vitro Transcription and Molecular Computation

Published on: December 29, 2021

4.2K
Analyzing and Building Nucleic Acid Structures with 3DNA
16:24

Analyzing and Building Nucleic Acid Structures with 3DNA

Published on: April 26, 2013

20.6K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Genomic sequence analysis traditionally relies on tokenization methods.
  • Existing foundation models often use encoder-only or decoder-only architectures.
  • Byte-level precision is crucial for accurate analysis of DNA sequences.

Purpose of the Study:

  • Introduce the Ensemble Nucleotide Byte-level Encoder-Decoder (ENBED) foundation model.
  • Develop an efficient, generalizable sequence-to-sequence model for genomic tasks.
  • Demonstrate ENBED's superiority over existing state-of-the-art methods.

Main Methods:

  • Utilized an encoder-decoder Transformer architecture with sub-quadratic attention.
  • Pre-trained the foundation model using Masked Language Modeling on reference genome sequences.
  • Applied the model to downstream tasks including sequence annotation and mutation generation.

Main Results:

  • Achieved significant improvements in identifying enhancers, promoters, and splice sites.
  • Successfully recognized sequences with base call errors and indels at byte-level precision.
  • Demonstrated state-of-the-art performance in biological function annotation and viral mutation generation.

Conclusions:

  • ENBED offers a powerful and efficient approach for byte-level genomic sequence analysis.
  • The encoder-decoder architecture generalizes previous genomic models.
  • ENBED significantly advances the state-of-the-art in multiple critical bioinformatics tasks.