Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

10.3K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Non-Target Effects of <i>Trichoderma</i> Spore Suspensions and Secondary Metabolites on Phytoseiid Predatory Mites.

Journal of fungi (Basel, Switzerland)·2026
Same author

Push-pull strategy that combines olfactory and visual cues for management of ambrosia beetles (Coleoptera: Curculionidae: Scolytinae and Platypodinae) in avocado.

Environmental entomology·2026
Same author

A layered standards framework for integrating single-cell and spatial omics data into brain cell atlases.

bioRxiv : the preprint server for biology·2026
Same author

A three-dimensional printed laryngoscope holding helmet for continuous laryngoscopy during exercise in a pediatric exercise induced laryngeal obstruction clinic.

International journal of pediatric otorhinolaryngology·2026
Same author

Prospects for biological control of wood-boring insects using phoretic mites.

Frontiers in plant science·2026
Same author

The Cell Ontology in the age of single-cell omics.

Scientific data·2026
Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026
Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026
Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026
Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026
Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026
Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Aug 27, 2025

Author Spotlight: Deciphering the Cellular Mysteries of Intermuscular Adipose Tissue in Humans
05:59

Author Spotlight: Deciphering the Cellular Mysteries of Intermuscular Adipose Tissue in Humans

Published on: May 3, 2024

808

Machine learning for cell type classification from single nucleus RNA sequencing data.

Huy Le1, Beverly Peng1, Janelle Uy1

  • 1Department of Bioengineering, University of California, San Diego, CA, United States of America.

Plos One
|September 23, 2022
PubMed
Summary
This summary is machine-generated.

Supervised machine learning improves cell type classification from single-nucleus RNA sequencing (snRNA-seq) data. Feature selection and multinomial logistic regression were key to accurately categorizing cells, overcoming challenges in distinguishing similar cell types.

More Related Videos

Isolation of Adult Spinal Cord Nuclei for Massively Parallel Single-nucleus RNA Sequencing
06:38

Isolation of Adult Spinal Cord Nuclei for Massively Parallel Single-nucleus RNA Sequencing

Published on: October 12, 2018

19.0K
Single-cell RNA-Seq of Defined Subsets of Retinal Ganglion Cells
11:26

Single-cell RNA-Seq of Defined Subsets of Retinal Ganglion Cells

Published on: May 22, 2017

13.9K

Related Experiment Videos

Last Updated: Aug 27, 2025

Author Spotlight: Deciphering the Cellular Mysteries of Intermuscular Adipose Tissue in Humans
05:59

Author Spotlight: Deciphering the Cellular Mysteries of Intermuscular Adipose Tissue in Humans

Published on: May 3, 2024

808
Isolation of Adult Spinal Cord Nuclei for Massively Parallel Single-nucleus RNA Sequencing
06:38

Isolation of Adult Spinal Cord Nuclei for Massively Parallel Single-nucleus RNA Sequencing

Published on: October 12, 2018

19.0K
Single-cell RNA-Seq of Defined Subsets of Retinal Ganglion Cells
11:26

Single-cell RNA-Seq of Defined Subsets of Retinal Ganglion Cells

Published on: May 22, 2017

13.9K

Area of Science:

  • Genomics
  • Computational Biology
  • Bioinformatics

Background:

  • Single-cell/nucleus RNA sequencing (sc/snRNA-seq) generates high-resolution transcriptomic data for cell phenotyping.
  • Classifying cells into specific types using sc/snRNA-seq data is challenging due to transcriptional similarities between related cell types and data sparsity.
  • Matching cell types across different sc/snRNA-seq experiments remains a significant hurdle.

Purpose of the Study:

  • To evaluate supervised machine learning methods for robust cell type classification using snRNA-seq data.
  • To identify optimal machine learning models and feature selection strategies for granular cell type identification.
  • To address challenges in distinguishing closely related cell types and improving cross-experiment cell type matching.

Main Methods:

  • Applied supervised machine learning algorithms including logistic regression, support vector machines, random forests, neural networks, and LightGBM.
  • Utilized snRNA-seq datasets from human brain middle temporal gyrus (MTG) and human kidney.
  • Evaluated classification performance using an F-beta score, emphasizing precision to mitigate gene expression dropout effects.
  • Investigated the impact of hyperparameter optimization and feature selection on model performance.

Main Results:

  • Multinomial logistic regression emerged as the best-performing model for granular cell type classification across both tested datasets.
  • An effective feature selection step was identified as the most critical factor for optimizing machine learning pipeline performance.
  • The chosen F-beta score weighting successfully accounted for technical artifacts like gene expression dropout.

Conclusions:

  • Supervised machine learning, particularly multinomial logistic regression coupled with robust feature selection, significantly enhances cell type classification accuracy from snRNA-seq data.
  • Feature selection is paramount for maximizing the performance of machine learning models in sc/snRNA-seq analysis.
  • These findings provide a data-driven approach to overcome limitations in current cell phenotyping techniques using sc/snRNA-seq.