Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: May 14, 2026

Optimization for Sequencing and Analysis of Degraded FFPE-RNA Samples
07:30

Optimization for Sequencing and Analysis of Degraded FFPE-RNA Samples

Published on: June 8, 2020

Statistically invalid classification of high throughput gene expression data.

Shahar Barbash1, Hermona Soreq

  • 1The Edmond & Lily Safra Center for Brain Sciences and the Department of Biological Chemistry at the Hebrew University of Jerusalem, Jerusalem, Israel.

Scientific Reports
|January 25, 2013
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Convergent blood-brain barrier breakdown in schizophrenia and autism spectrum disorders: a systematic review of preclinical animal models.

Fluids and barriers of the CNS·2026
Same author

Identifying maximally informative signal-aware representations of single-cell data using the information bottleneck.

Cell systems·2026
Same author

Conserved small RNA networks link inflammation to pain signaling in mice and men.

Translational psychiatry·2025
Same author

Context dependent role of miR-486 promoting neuroregeneration of primary sensory neurons downstream of interleukin-6 signal transducer.

Molecular therapy. Nucleic acids·2025
Same author

Botulinum Neurotoxins: History, Mechanism, and Applications. A Narrative Review.

Journal of neurochemistry·2025
Same author

The cholinergic synapses.

Handbook of clinical neurology·2025
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Many scientific studies use classification analysis with high throughput data, but over half contain a statistical error leading to biased results. This research identifies the error and offers valid methods to improve accuracy in gene expression classification.

Area of Science:

  • High throughput data analysis
  • Neuroscience
  • Genomics
  • Biostatistics

Background:

  • Classification analysis of high throughput data is crucial in neuroscience and biology for understanding health and disease.
  • Outcomes inform biochemical mechanisms, therapeutic targets, and diagnostics.
  • Gene expression data is frequently used for these classification tasks.

Purpose of the Study:

  • To identify and characterize a common statistical error in high throughput data classification.
  • To assess the scope and impact of this methodological error on scientific conclusions.
  • To provide statistically valid alternatives for accurate classification analysis.

Main Methods:

  • Screened 111 high-impact manuscripts utilizing gene expression classification.

More Related Videos

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes
05:07

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

A Combinatorial Single-cell Approach to Characterize the Molecular and Immunophenotypic Heterogeneity of Human Stem and Progenitor Populations
09:34

A Combinatorial Single-cell Approach to Characterize the Molecular and Immunophenotypic Heterogeneity of Human Stem and Progenitor Populations

Published on: October 25, 2018

Related Experiment Videos

Last Updated: May 14, 2026

Optimization for Sequencing and Analysis of Degraded FFPE-RNA Samples
07:30

Optimization for Sequencing and Analysis of Degraded FFPE-RNA Samples

Published on: June 8, 2020

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes
05:07

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

A Combinatorial Single-cell Approach to Characterize the Molecular and Immunophenotypic Heterogeneity of Human Stem and Progenitor Populations
09:34

A Combinatorial Single-cell Approach to Characterize the Molecular and Immunophenotypic Heterogeneity of Human Stem and Progenitor Populations

Published on: October 25, 2018

  • Analyzed the statistical methodologies employed in the selected studies.
  • Investigated the influence of experimental parameters on the identified error.
  • Main Results:

    • 53% of the screened studies (58 out of 111) employed a statistically invalid classification method.
    • This invalid method leads to an overestimation of classification accuracy, introducing bias.
    • The scope and impact of the error vary with different experimental parameters.

    Conclusions:

    • A significant proportion of recent high-impact studies rely on flawed statistical approaches for gene expression classification.
    • This prevalent error compromises the reliability of findings in basic biology and disease-related research.
    • Implementing statistically valid methods is essential to ensure accurate and unbiased classification results.