Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conserved Binding Sites01:49

Conserved Binding Sites

5.3K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
5.3K
Conserved Binding Sites01:49

Conserved Binding Sites

2.0K
2.0K
Ligand Binding Sites02:40

Ligand Binding Sites

15.7K
Proteins are dynamic macromolecules that carry out a wide variety of essential processes; however, the activities of most proteins depend on their interactions with other molecules or ions, known as ligands.
Protein-ligand interactions are quite specific; even though numerous potential ligands surround a cellular protein at any given time, only a particular ligand can bind to that protein. Moreover, a ligand binds only to a dedicated area on the surface of the protein, known as the...
15.7K
Ligand Binding Sites02:40

Ligand Binding Sites

9.1K
9.1K
Leaky Scanning02:28

Leaky Scanning

5.9K
During most eukaryotic translation processes, the small 40S ribosome subunit scans an mRNA from its 5' end until it encounters the first start AUG codon. The large 60S ribosomal subunit then joins the smaller one to initiate protein synthesis. The location of the translation initiation is largely determined by the nucleotides near the start codon as there may be multiple translation initiation sites present on the mRNA.  Marilyn Kozak discovered that the sequence RCCAUGG (where R...
5.9K
Improving Translational Accuracy02:07

Improving Translational Accuracy

15.5K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
15.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

STE-DC2I Uncovers Driver Genes in Colorectal Cancer Subtypes Using Symbolic Trajectory-Embedded Dark Causal Inference.

Journal of chemical information and modeling·2026
Same author

LLM-Enhanced Knowledge Distillation for Sequence-Based Protein-Ligand Interaction Prediction.

IEEE journal of biomedical and health informatics·2026
Same author

Robust graph structure learning to improve multi-omics cancer subtype classification.

BMC bioinformatics·2026
Same author

Draft genome sequences of three <i>Streptomyces</i> sp. strains isolated from marine sponges in Kochi Prefecture, Japan.

Microbiology resource announcements·2026
Same author

Revealing intra-group immunotherapy response heterogeneity in metastatic urothelial carcinoma through interpretable feature extraction and spectral clustering.

Frontiers in immunology·2026
Same author

Correction: Anomaly detection in double-entry bookkeeping data by federated learning system with non-model sharing approach.

Scientific reports·2026
Same journal

AdaWGAN: Data Augmentation for Few-Shot HD-sEMG Gesture Recognition Using Single-Trial Data.

IEEE journal of biomedical and health informatics·2026
Same journal

NeuroBooster: a domain-informed self-supervised learning paradigm tailored for brain MRI analysis.

IEEE journal of biomedical and health informatics·2026
Same journal

Graph Convolutional Neural Network based Depression Detection using Brain Functional Connectivity Measures.

IEEE journal of biomedical and health informatics·2026
Same journal

Improving Multi-Sensor Non-Invasive Glucose Detection through AI: A Domain Generalization Approach.

IEEE journal of biomedical and health informatics·2026
Same journal

Unmixing the Neck: Accurate Jugular Venous Pulse Detection From Wearable PPG.

IEEE journal of biomedical and health informatics·2026
Same journal

AD-DAE: Alzheimer's Disease Progression Modeling with Unpaired Longitudinal MRI using Diffusion Auto-Encoders.

IEEE journal of biomedical and health informatics·2026
See all related articles

Related Experiment Video

Updated: Mar 29, 2026

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells
10:34

Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

Published on: December 9, 2022

5.4K

A Dual-Language-Model Framework for Reproducibility in Small Molecule-RNA Binding Site Prediction.

Shixuan Guan, Xiucai Ye, Tetsuya Sakurai

    IEEE Journal of Biomedical and Health Informatics
    |March 27, 2026
    PubMed
    Summary
    This summary is machine-generated.

    Single-seed evaluations in molecular learning can overestimate performance. Reproducibility analysis for RNA-ligand binding site prediction shows multi-seed runs are crucial for accurate performance metrics.

    More Related Videos

    Exploring Sequence Space to Identify Binding Sites for Regulatory RNA-Binding Proteins
    11:34

    Exploring Sequence Space to Identify Binding Sites for Regulatory RNA-Binding Proteins

    Published on: August 9, 2019

    7.2K
    Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA
    10:21

    Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA

    Published on: February 23, 2024

    3.9K

    Related Experiment Videos

    Last Updated: Mar 29, 2026

    Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells
    10:34

    Probing RNA Structure with Dimethyl Sulfate Mutational Profiling with Sequencing In Vitro and in Cells

    Published on: December 9, 2022

    5.4K
    Exploring Sequence Space to Identify Binding Sites for Regulatory RNA-Binding Proteins
    11:34

    Exploring Sequence Space to Identify Binding Sites for Regulatory RNA-Binding Proteins

    Published on: August 9, 2019

    7.2K
    Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA
    10:21

    Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA

    Published on: February 23, 2024

    3.9K

    Area of Science:

    • Computational biology
    • Machine learning in drug discovery
    • Bioinformatics

    Background:

    • Single-seed evaluation is common in small-dataset molecular learning but can inflate performance estimates.
    • Reproducibility in RNA-ligand binding site prediction using large pretrained RNA language models is underexplored.

    Purpose of the Study:

    • To conduct the first systematic reproducibility analysis for RNA-ligand binding site prediction.
    • To integrate two large pretrained RNA language models (RNA-FM and RiNALMo) across multiple fusion architectures.
    • To assess performance over replicated training runs on the TR60/TE18 benchmark.

    Main Methods:

    • Utilized two large pretrained RNA language models: RNA-FM and RiNALMo.
    • Implemented multiple fusion architectures, including Reverse Cross-Attention and simple concat fusion.
    • Performed replicated training runs on the TR60/TE18 benchmark dataset.
    • Analyzed performance using Matthews Correlation Coefficient (MCC) and mean accuracy with standard deviation.

    Main Results:

    • A Peak-SOTA Paradox was observed, where a single-seed run (MCC 0.353) surpassed reported state-of-the-art, while multi-seed replication yielded a lower average (0.266 ± 0.020), a 32.8% overestimation.
    • Mean accuracy was consistent across architectures, but reproducibility varied significantly.
    • Simple concat fusion strategies showed higher stability than attention-based models under data scarcity.
    • Single-seed evaluations can overstate expected performance by 20-30% in limited-sample regimes.

    Conclusions:

    • Reproducibility should be a primary evaluation criterion for small-sample molecular prediction.
    • A dual-reporting standard is motivated: mean ± SD as the principal metric and peak scores as supplementary evidence.
    • Architectural choices, not just parameter count, influence variance in low-data scenarios.
    • Variance-aware evaluation is essential to avoid misrepresenting model performance in limited-sample settings.