Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Random-with-constraints: Constructing minimal models for high-dimensional biology.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same author

Emergent frequency-dependent selection predicts mutation outcomes in complex ecological communities.

bioRxiv : the preprint server for biology·2026

Same author

Ecology of metagenomes: incorporating genotype-to-phenotype maps into ecological models.

bioRxiv : the preprint server for biology·2026

Same author

From genes to collective modes: biological constraints shape metabolic evolution.

bioRxiv : the preprint server for biology·2026

Same author

Inferring genotype-phenotype maps using attention models.

PNAS nexus·2026

Same author

Bidirectional fibrogenic cross-talk revealed in a human iPSC-derived epithelial-mesenchymal co-culture model of pulmonary fibrosis.

bioRxiv : the preprint server for biology·2026

Same journal

Complex Indel Detection: A Simulation-Based Framework and Parsing with FreeBayes.

bioRxiv : the preprint server for biology·2026

Same journal

Emulating the gingival-tooth interface during bacterial, fungal, and viral infection in a microphysiological model of the human oral cavity.

bioRxiv : the preprint server for biology·2026

Same journal

Local SNP-explained methylation variation reveals genetically anchored and exposure-associated methylation architecture in the human brain.

bioRxiv : the preprint server for biology·2026

Same journal

Perinatal Semaglutide Treatment Improves Maternal Health and Mitigates Offspring Metabolic Dysfunction in a Mouse Model of Maternal Obesity.

bioRxiv : the preprint server for biology·2026

Same journal

Pervasive cryptic selection in the human noncoding genome.

bioRxiv : the preprint server for biology·2026

Same journal

Secreted ORF8 reprograms macrophages to enhance SARS-CoV-2 infection of lung epithelial cells.

bioRxiv : the preprint server for biology·2026

See all related articles

Search research articles

Home
Parameter-free Representations Outperform Single-cell Foundation Models On Downstream Benchmarks.

Home
Parameter-free Representations Outperform Single-cell Foundation Models On Downstream Benchmarks.

Related Experiment Video

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

Parameter-free representations outperform single-cell foundation models on downstream benchmarks.

Huan Souza¹, Pankaj Mehta^1,2

¹Department of Physics, Boston University, Boston, MA, 02215, USA.

Biorxiv : the Preprint Server for Biology

|February 23, 2026

View abstract on PubMed

Summary

This summary is machine-generated.

Simple linear models can match complex foundation models for analyzing single-cell RNA sequencing (scRNA-seq) data. This research shows interpretable methods achieve state-of-the-art results, even on novel cell types and organisms.

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Related Experiment Videos

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Area of Science:

Computational Biology
Genomics
Bioinformatics

Background:

Single-cell RNA sequencing (scRNA-seq) data possesses inherent statistical structure, driving the development of advanced foundation models.
Transformer-based models like TranscriptFormer learn gene expression patterns by embedding genes into latent spaces, achieving state-of-the-art (SOTA) results in various biological tasks.

Purpose of the Study:

To investigate if SOTA performance in analyzing scRNA-seq data can be achieved using computationally efficient, interpretable methods instead of complex deep learning representations.
To evaluate the efficacy of simple normalization and linear modeling pipelines against established foundation models.

Main Methods:

Development of interpretable pipelines utilizing careful data normalization techniques.

Application of linear methods for gene expression data analysis.

Benchmarking against SOTA foundation models on established datasets and out-of-distribution tasks.

Main Results:

Simple linear pipelines achieved SOTA or near-SOTA performance across multiple benchmarks for scRNA-seq data analysis.
These methods outperformed foundation models on out-of-distribution tasks, including novel cell types and organisms not present in training data.
Demonstrated that linear representations can effectively capture the biology of cell identity.

Conclusions:

Computationally intensive deep learning representations are not always necessary for achieving high performance in scRNA-seq data analysis.
Rigorous benchmarking is crucial for evaluating the true capabilities of computational methods in genomics.
Interpretable, linear models offer a powerful and efficient alternative for understanding cell identity from gene expression data.