Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

15.2K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
15.2K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.7K
3.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Random-with-constraints: Constructing minimal models for high-dimensional biology.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Emergent frequency-dependent selection predicts mutation outcomes in complex ecological communities.

bioRxiv : the preprint server for biology·2026
Same author

Ecology of metagenomes: incorporating genotype-to-phenotype maps into ecological models.

bioRxiv : the preprint server for biology·2026
Same author

From genes to collective modes: biological constraints shape metabolic evolution.

bioRxiv : the preprint server for biology·2026
Same author

Inferring genotype-phenotype maps using attention models.

PNAS nexus·2026
Same author

Bidirectional fibrogenic cross-talk revealed in a human iPSC-derived epithelial-mesenchymal co-culture model of pulmonary fibrosis.

bioRxiv : the preprint server for biology·2026
Same journal

Complex Indel Detection: A Simulation-Based Framework and Parsing with FreeBayes.

bioRxiv : the preprint server for biology·2026
Same journal

Emulating the gingival-tooth interface during bacterial, fungal, and viral infection in a microphysiological model of the human oral cavity.

bioRxiv : the preprint server for biology·2026
Same journal

Local SNP-explained methylation variation reveals genetically anchored and exposure-associated methylation architecture in the human brain.

bioRxiv : the preprint server for biology·2026
Same journal

Perinatal Semaglutide Treatment Improves Maternal Health and Mitigates Offspring Metabolic Dysfunction in a Mouse Model of Maternal Obesity.

bioRxiv : the preprint server for biology·2026
Same journal

Pervasive cryptic selection in the human noncoding genome.

bioRxiv : the preprint server for biology·2026
Same journal

Secreted ORF8 reprograms macrophages to enhance SARS-CoV-2 infection of lung epithelial cells.

bioRxiv : the preprint server for biology·2026
See all related articles
  1. Home
  2. Parameter-free Representations Outperform Single-cell Foundation Models On Downstream Benchmarks.
  1. Home
  2. Parameter-free Representations Outperform Single-cell Foundation Models On Downstream Benchmarks.

Related Experiment Video

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

19.2K

Parameter-free representations outperform single-cell foundation models on downstream benchmarks.

Huan Souza1, Pankaj Mehta1,2

  • 1Department of Physics, Boston University, Boston, MA, 02215, USA.

Biorxiv : the Preprint Server for Biology
|February 23, 2026

View abstract on PubMed

Summary
This summary is machine-generated.

Simple linear models can match complex foundation models for analyzing single-cell RNA sequencing (scRNA-seq) data. This research shows interpretable methods achieve state-of-the-art results, even on novel cell types and organisms.

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K

Related Experiment Videos

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

19.2K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K

Area of Science:

  • Computational Biology
  • Genomics
  • Bioinformatics

Background:

  • Single-cell RNA sequencing (scRNA-seq) data possesses inherent statistical structure, driving the development of advanced foundation models.
  • Transformer-based models like TranscriptFormer learn gene expression patterns by embedding genes into latent spaces, achieving state-of-the-art (SOTA) results in various biological tasks.

Purpose of the Study:

  • To investigate if SOTA performance in analyzing scRNA-seq data can be achieved using computationally efficient, interpretable methods instead of complex deep learning representations.
  • To evaluate the efficacy of simple normalization and linear modeling pipelines against established foundation models.

Main Methods:

  • Development of interpretable pipelines utilizing careful data normalization techniques.
  • Application of linear methods for gene expression data analysis.
  • Benchmarking against SOTA foundation models on established datasets and out-of-distribution tasks.
  • Main Results:

    • Simple linear pipelines achieved SOTA or near-SOTA performance across multiple benchmarks for scRNA-seq data analysis.
    • These methods outperformed foundation models on out-of-distribution tasks, including novel cell types and organisms not present in training data.
    • Demonstrated that linear representations can effectively capture the biology of cell identity.

    Conclusions:

    • Computationally intensive deep learning representations are not always necessary for achieving high performance in scRNA-seq data analysis.
    • Rigorous benchmarking is crucial for evaluating the true capabilities of computational methods in genomics.
    • Interpretable, linear models offer a powerful and efficient alternative for understanding cell identity from gene expression data.