Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Long-patch Base Excision Repair01:02

Long-patch Base Excision Repair

7.8K
Since the discovery of the two BER pathways, there has been a debate about how a cell chooses one pathway over the other and the factors determining this selection. Numerous in vitro experiments have pointed out multiple determinants for the sub-pathway selection. These are:
7.8K
RNA Editing02:23

RNA Editing

9.7K
RNA editing is a post-transcriptional modification where a precursor mRNA (pre-mRNA) nucleotide sequence is changed by base insertion, deletion, or modification. The extent of RNA editing varies from a few hundred bases, in mitochondrial DNA of trypanosomes, to a just single base, in nuclear genes of mammals. Even a single base change in the pre-mRNA can convert a codon for one amino acid into the codon for another amino acid or a stop codon. This type of re-coding can significantly affect the...
9.7K
RNA Interference01:23

RNA Interference

27.7K
RNA interference (RNAi) is a process in which a small non-coding RNA molecule blocks the post-transcriptional expression of a gene by binding to its messenger RNA (mRNA) and preventing the protein from being translated.
This process occurs naturally in cells, often through the activity of genomically-encoded microRNAs. Researchers can take advantage of this mechanism by introducing synthetic RNAs to deactivate specific genes for research or therapeutic purposes. For example, RNAi could be used...
27.7K
RNA-seq03:21

RNA-seq

11.7K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
11.7K
Bacterial RNA Polymerase00:43

Bacterial RNA Polymerase

32.4K
Unlike eukaryotes, bacteria use a single RNA Polymerase (RNAP) to transcribe all genes. The different subunits of bacterial RNAPhave distinct functions. The multisubunit structure of the bacterial RNAP helps the enzyme to maintain catalytic function, facilitate assembly, interact with DNA and RNA, and self-regulate its activity.
In most genes, the transcription site is a single base present upstream of the coding sequence. Though RNAP is a catalytically efficient enzyme, it does not recognize...
32.4K
Bacterial RNA Polymerase00:43

Bacterial RNA Polymerase

11.5K
11.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Localization of macromolecules in crowded cellular cryo-electron tomograms from extremely sparse labels.

Briefings in bioinformatics·2025
Same author

BioLLMNet: enhancing RNA-interaction prediction with a specialized cross-LLM transformation network.

Briefings in bioinformatics·2025
Same author

Leveraging Weighted Quartet Distributions for Enhanced Species Tree Inference from Genome-Wide Data.

Genome biology and evolution·2025
Same author

LOCAS: multilabel mRNA localization with supervised contrastive learning.

Briefings in bioinformatics·2025
Same author

Integrated screens identify AURKB dependency in advanced gastrointestinal stromal tumors.

The Journal of experimental medicine·2025
Same author

The TEX44-CPT1B axis regulates mitochondrial sheath assembly and fatty acid oxidation in sperm.

Nature communications·2025
Same journal

Ciliary flow and morphology shape mass transport at the surface and within gastrovascular cavities of black corals.

Communications biology·2026
Same journal

Virus-mediated prokaryotic community adaptation dynamics under thermal stress in municipal organic solid waste microbiomes.

Communications biology·2026
Same journal

Multi-omics insights into the woolly trait of Saussurea medusa and the plant's coordinated regulation of flavonoid biosynthesis.

Communications biology·2026
Same journal

Loss contexts enhance dorsolateral prefrontal interpersonal neural synchrony during successful human deceptive recommendations.

Communications biology·2026
Same journal

Neuro-regulator role of H<sub>2</sub>S in astrocyte activation and its effects on neurological damage and behavior of VPA-exposed rats.

Communications biology·2026
Same journal

Temporal orchestration of transcriptional and epigenomic programming underlying maternal embryonic diapause in a cricket model.

Communications biology·2026
See all related articles

Related Experiment Video

Updated: Jan 10, 2026

Author Spotlight: Demonstrating Systematic Endobronchial Ultrasound to New Endoscopists
05:22

Author Spotlight: Demonstrating Systematic Endobronchial Ultrasound to New Endoscopists

Published on: August 11, 2023

2.7K

BiRNA-BERT allows efficient RNA language modeling with adaptive tokenization.

Md Toki Tahmid1, Haz Sameen Shahgir2, Sazan Mahbub3

  • 1Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh.

Communications Biology
|November 20, 2025
PubMed
Summary
This summary is machine-generated.

BiRNA-BERT overcomes RNA sequence length limitations using adaptive dual-tokenization, enabling analysis of long RNA sequences without data loss. This Transformer model achieves state-of-the-art performance in RNA structure prediction and language modeling.

More Related Videos

Mapping RNA-RNA Interactions Globally Using Biotinylated Psoralen
11:32

Mapping RNA-RNA Interactions Globally Using Biotinylated Psoralen

Published on: May 24, 2017

12.6K

Related Experiment Videos

Last Updated: Jan 10, 2026

Author Spotlight: Demonstrating Systematic Endobronchial Ultrasound to New Endoscopists
05:22

Author Spotlight: Demonstrating Systematic Endobronchial Ultrasound to New Endoscopists

Published on: August 11, 2023

2.7K
Mapping RNA-RNA Interactions Globally Using Biotinylated Psoralen
11:32

Mapping RNA-RNA Interactions Globally Using Biotinylated Psoralen

Published on: May 24, 2017

12.6K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Genomics

Background:

  • Transformer models show promise in biological sequence analysis but struggle with RNA's long sequences.
  • Current RNA language models often truncate sequences, losing critical distal nucleotide context.
  • Standard NLP tokenization methods are ill-suited for nucleotide-level RNA analysis.

Purpose of the Study:

  • To develop a Transformer model, BiRNA-BERT, capable of processing arbitrarily long RNA sequences.
  • To introduce an adaptive dual-tokenization framework for RNA sequence modeling.
  • To achieve state-of-the-art performance in RNA-related tasks, including structural prediction.

Main Methods:

  • Developed BiRNA-BERT, a 117M-parameter Transformer encoder trained on 36 million non-coding RNA sequences.
  • Implemented an adaptive dual-tokenization framework combining nucleotide-level (NUC) and byte-pair encoding (BPE).
  • Dynamically selected tokenization strategy based on input sequence length to avoid truncation.

Main Results:

  • BiRNA-BERT processed arbitrarily long RNA sequences without truncation, outperforming existing models.
  • Achieved state-of-the-art results in tasks from short-sequence classification to long-context modeling and RNA structural prediction.
  • Demonstrated strong intrinsic language modeling performance (perplexity, token recovery) with a compact model size.

Conclusions:

  • BiRNA-BERT effectively addresses RNA sequence length constraints in Transformer models.
  • The adaptive dual-tokenization framework enhances RNA sequence modeling capabilities.
  • BiRNA-BERT offers a powerful and efficient solution for diverse RNA analysis tasks.