Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Conserved Binding Sites

Conserved Binding Sites

Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...

Conserved Binding Sites

Conserved Binding Sites

Ligand Binding Sites

Ligand Binding Sites

Proteins are dynamic macromolecules that carry out a wide variety of essential processes; however, the activities of most proteins depend on their interactions with other molecules or ions, known as ligands.
Protein-ligand interactions are quite specific; even though numerous potential ligands surround a cellular protein at any given time, only a particular ligand can bind to that protein. Moreover, a ligand binds only to a dedicated area on the surface of the protein, known as the...

Ligand Binding Sites

Ligand Binding Sites

Leaky Scanning

Leaky Scanning

During most eukaryotic translation processes, the small 40S ribosome subunit scans an mRNA from its 5' end until it encounters the first start AUG codon. The large 60S ribosomal subunit then joins the smaller one to initiate protein synthesis. The location of the translation initiation is largely determined by the nucleotides near the start codon as there may be multiple translation initiation sites present on the mRNA. Marilyn Kozak discovered that the sequence RCCAUGG (where R...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

STE-DC2I Uncovers Driver Genes in Colorectal Cancer Subtypes Using Symbolic Trajectory-Embedded Dark Causal Inference.

Journal of chemical information and modeling·2026

Same author

LLM-Enhanced Knowledge Distillation for Sequence-Based Protein-Ligand Interaction Prediction.

IEEE journal of biomedical and health informatics·2026

Same author

Robust graph structure learning to improve multi-omics cancer subtype classification.

BMC bioinformatics·2026

Same author

Draft genome sequences of three <i>Streptomyces</i> sp. strains isolated from marine sponges in Kochi Prefecture, Japan.

Microbiology resource announcements·2026

Same author

Revealing intra-group immunotherapy response heterogeneity in metastatic urothelial carcinoma through interpretable feature extraction and spectral clustering.

Frontiers in immunology·2026

Same author

Correction: Anomaly detection in double-entry bookkeeping data by federated learning system with non-model sharing approach.

Scientific reports·2026

Same journal

AdaWGAN: Data Augmentation for Few-Shot HD-sEMG Gesture Recognition Using Single-Trial Data.

IEEE journal of biomedical and health informatics·2026

Same journal

NeuroBooster: a domain-informed self-supervised learning paradigm tailored for brain MRI analysis.

IEEE journal of biomedical and health informatics·2026

Same journal

Graph Convolutional Neural Network based Depression Detection using Brain Functional Connectivity Measures.

IEEE journal of biomedical and health informatics·2026

Same journal

Improving Multi-Sensor Non-Invasive Glucose Detection through AI: A Domain Generalization Approach.

IEEE journal of biomedical and health informatics·2026

Same journal

Unmixing the Neck: Accurate Jugular Venous Pulse Detection From Wearable PPG.

IEEE journal of biomedical and health informatics·2026

Same journal

AD-DAE: Alzheimer's Disease Progression Modeling with Unpaired Longitudinal MRI using Diffusion Auto-Encoders.

IEEE journal of biomedical and health informatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 29, 2026

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Published on: December 9, 2022

A Dual-Language-Model Framework for Reproducibility in Small Molecule-RNA Binding Site Prediction.

Shixuan Guan, Xiucai Ye, Tetsuya Sakurai

IEEE Journal of Biomedical and Health Informatics

|March 27, 2026

Summary

This summary is machine-generated.

Single-seed evaluations in molecular learning can overestimate performance. Reproducibility analysis for RNA-ligand binding site prediction shows multi-seed runs are crucial for accurate performance metrics.

More Related Videos

Exploring Sequence Space to Identify Binding Sites for Regulatory RNA-Binding Proteins

Exploring Sequence Space to Identify Binding Sites for Regulatory RNA-Binding Proteins

Published on: August 9, 2019

Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA

Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA

Published on: February 23, 2024

Related Experiment Videos

Last Updated: Mar 29, 2026

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Published on: December 9, 2022

Exploring Sequence Space to Identify Binding Sites for Regulatory RNA-Binding Proteins

Exploring Sequence Space to Identify Binding Sites for Regulatory RNA-Binding Proteins

Published on: August 9, 2019

Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA

Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA

Published on: February 23, 2024

Area of Science:

Computational biology
Machine learning in drug discovery
Bioinformatics

Background:

Single-seed evaluation is common in small-dataset molecular learning but can inflate performance estimates.
Reproducibility in RNA-ligand binding site prediction using large pretrained RNA language models is underexplored.

Purpose of the Study:

To conduct the first systematic reproducibility analysis for RNA-ligand binding site prediction.
To integrate two large pretrained RNA language models (RNA-FM and RiNALMo) across multiple fusion architectures.
To assess performance over replicated training runs on the TR60/TE18 benchmark.

Main Methods:

Utilized two large pretrained RNA language models: RNA-FM and RiNALMo.
Implemented multiple fusion architectures, including Reverse Cross-Attention and simple concat fusion.
Performed replicated training runs on the TR60/TE18 benchmark dataset.
Analyzed performance using Matthews Correlation Coefficient (MCC) and mean accuracy with standard deviation.

Main Results:

A Peak-SOTA Paradox was observed, where a single-seed run (MCC 0.353) surpassed reported state-of-the-art, while multi-seed replication yielded a lower average (0.266 ± 0.020), a 32.8% overestimation.
Mean accuracy was consistent across architectures, but reproducibility varied significantly.
Simple concat fusion strategies showed higher stability than attention-based models under data scarcity.
Single-seed evaluations can overstate expected performance by 20-30% in limited-sample regimes.

Conclusions:

Reproducibility should be a primary evaluation criterion for small-sample molecular prediction.
A dual-reporting standard is motivated: mean ± SD as the principal metric and peak scores as supplementary evidence.
Architectural choices, not just parameter count, influence variance in low-data scenarios.
Variance-aware evaluation is essential to avoid misrepresenting model performance in limited-sample settings.