Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Nucleic Acid Structure

Nucleic Acid Structure

The pentose sugar in DNA is deoxyribose, while in RNA the pentose sugar is ribose. The difference between the sugars is the presence of the hydroxyl group on the ribose's second carbon and a hydrogen on the deoxyribose's second carbon. The phosphate residue attaches to the hydroxyl group of the 5′ carbon of one sugar and the hydroxyl group of the 3′ carbon of the sugar of the next nucleotide, which forms a 5′ to 3′ phosphodiester linkage.
DNA Structure
DNA...

RNA Stability

RNA Stability

Intact DNA strands can be found in fossils, while scientists sometimes struggle to keep RNA intact under laboratory conditions. The structural variations between RNA and DNA underlie the differences in their stability and longevity. Because DNA is double-stranded, it is inherently more stable. The single-stranded structure of RNA is less stable but also more flexible and can form weak internal bonds. Additionally, most RNAs in the cell are relatively short, while DNA can be up to 250 million...

Nucleic Acids

Nucleic Acids

Nucleic acids are the most important macromolecules for the continuity of life. They carry the cell's genetic blueprint and carry instructions for its functioning.
DNA and RNA
The two main types of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). DNA is the genetic material in all living organisms, ranging from single-celled bacteria to multicellular mammals. It is in the nucleus of eukaryotes and in the organelles, chloroplasts, and mitochondria. In prokaryotes,...

Bacterial RNA Polymerase

Bacterial RNA Polymerase

Unlike eukaryotes, bacteria use a single RNA Polymerase (RNAP) to transcribe all genes. The different subunits of bacterial RNAPhave distinct functions. The multisubunit structure of the bacterial RNAP helps the enzyme to maintain catalytic function, facilitate assembly, interact with DNA and RNA, and self-regulate its activity.
In most genes, the transcription site is a single base present upstream of the coding sequence. Though RNAP is a catalytically efficient enzyme, it does not recognize...

Nucleic acids

Nucleic acids

Nucleic acids are the most important macromolecules for the continuity of life. They carry the cell's genetic blueprint and carry instructions for its functioning.
DNA and RNA
The two main types of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). DNA is the genetic material in all living organisms, ranging from single-celled bacteria to multicellular mammals. It is in the nucleus of eukaryotes and in the organelles, chloroplasts, and mitochondria. In prokaryotes,...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

GnnDebugger: GNN based error correction in De Bruijn Graphs.

BMC bioinformatics·2026

Same author

DUSP1 is a Key Driver of Disease Persistence and Potential Therapeutic Target in Hairy Cell Leukemia.

Blood advances·2026

Same author

A complete human pancreatic cancer genome.

bioRxiv : the preprint server for biology·2026

Same author

Evaluating Molecular Docking Programs for RNA-Targeted Ligand Screening: Influence of Binding Modes and Ligand Types.

Journal of chemical information and modeling·2026

Same author

Telomere-to-telomere assembly using HERRO-corrected Nanopore Simplex reads.

Nature·2026

Same author

Direct RNA sequencing and signal alignment reveal RNA structure ensembles in a eukaryotic cell.

Nature methods·2026

Same journal

Demonstration of a quantum C-NOT gate in a time-multiplexed fully reconfigurable photonic processor.

Nature communications·2026

Same journal

Nonlinear quantum light source with van der Waals ferroelectric NbOX<sub>2</sub> (X = Br, I).

Nature communications·2026

Same journal

Antagonistic histone H2A variants and autonomous heterochromatin formation shape epigenomic patterns in Arabidopsis.

Nature communications·2026

Same journal

The long tail of nitrate pollution in groundwater challenges governance of global water quality.

Nature communications·2026

Same journal

Select microbial metabolites promote tau aggregation in a murine tauopathy model.

Nature communications·2026

Same journal

Warming climate has lengthened global intense tropical cyclone seasons.

Nature communications·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 17, 2025

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Published on: December 9, 2022

RiNALMo: general-purpose RNA language models can generalize well on structure prediction tasks.

Rafael Josip Penić¹, Tin Vlašić², Roland G Huber³

¹Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia.

Nature Communications

|July 2, 2025

Summary

This summary is machine-generated.

Researchers developed the largest RNA language model, RiNALMo, to decode RNA sequences. This advanced model extracts hidden knowledge and predicts RNA structures, outperforming existing methods on unseen RNA families.

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

RNA Secondary Structure Prediction Using High-throughput SHAPE

RNA Secondary Structure Prediction Using High-throughput SHAPE

Published on: May 31, 2013

Related Experiment Videos

Last Updated: Sep 17, 2025

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Published on: December 9, 2022

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

RNA Secondary Structure Prediction Using High-throughput SHAPE

RNA Secondary Structure Prediction Using High-throughput SHAPE

Published on: May 31, 2013

Area of Science:

Bioinformatics
Computational Biology
Molecular Biology

Background:

Ribonucleic acid (RNA) is emerging as a key target for small-molecule drugs, necessitating a deeper understanding of its structure and function.
Vast amounts of unlabeled RNA sequence data generated by sequencing technologies hold significant untapped potential for biological insights.
Existing deep learning methods struggle with generalizing RNA secondary structure predictions to novel RNA families.

Purpose of the Study:

To introduce the RiboNucleic Acid Language Model (RiNALMo), the largest RNA language model developed to date.
To leverage advances in protein language models for analyzing RNA sequences.
To extract implicit structural information and hidden knowledge from large RNA datasets.

Main Methods:

Pre-training RiNALMo, a 650M parameter model, on a dataset of 36 million non-coding RNA sequences from diverse databases.
Utilizing a transformer-based architecture, similar to successful protein language models.
Evaluating RiNALMo's performance on various downstream tasks, including secondary structure prediction.

Main Results:

RiNALMo achieved state-of-the-art performance across multiple RNA-related downstream tasks.
Demonstrated superior generalization capabilities compared to existing deep learning models, particularly for predicting secondary structures of unseen RNA families.
Successfully captured implicit structural information embedded within RNA sequences.

Conclusions:

RiNALMo represents a significant advancement in analyzing RNA sequences and understanding their functions.
The model's ability to generalize to new RNA families addresses a critical limitation in current deep learning approaches for RNA structure prediction.
RiNALMo unlocks the potential of large unlabeled RNA datasets for drug discovery and biological research.