Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

10.4K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Integrated transcriptomic analysis reveals lymphatic <i>Icam1</i>-mediated immune dynamics after myocardial infarction.

Zoological research·2026
Same author

A mechanobiological hypothesis on bone cement-induced progression of bone metastases.

Frontiers in bioengineering and biotechnology·2026
Same author

A Dual-Focus Workflow for Simultaneously Engineering High Thermostability of Aldo-Keto Reductase for the Degradation of 3-Keto-Deoxynivalenol.

Journal of agricultural and food chemistry·2026
Same author

Structural basis of NMI-IFP35 domains and swapping phenomenon in IFP35-NID.

Journal of structural biology: X·2026
Same author

Integrated Single-Cell and Spatial Analysis Reveals a Metabolic-Immune Axis Driving Aortic Dissection.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026
Same author

Latent transition analysis of stigma and its association with treatment adherence in pulmonary tuberculosis patients during anti-tuberculosis therapy.

Frontiers in public health·2026
Same journal

Predicting piRNA-Disease Associations Based on Dual-View Learning and Multi-head Self-Attention Mechanism Fusion.

Interdisciplinary sciences, computational life sciences·2026
Same journal

DTANet+: Dual Interaction and Kernel-Diverse Network for Drug-Target Affinity Prediction.

Interdisciplinary sciences, computational life sciences·2026
Same journal

STNMAE: Identifying Spatial Domains from Spatial Transcriptomics Data with Neighbor-Aware Multi-view Masked Graph Autoencoder.

Interdisciplinary sciences, computational life sciences·2026
Same journal

Diagnosis and Prediction of Alzheimer's Disease via a High-Level Convolutional Block Attention Module-Residual Network.

Interdisciplinary sciences, computational life sciences·2026
Same journal

Deep3D-DTA: A Tri-Modal Deep Learning Framework for Binding Affinity Prediction Leveraging 3D Structural Representations of Drugs and Targets.

Interdisciplinary sciences, computational life sciences·2026
Same journal

ST-LDAW: A Topic-Model and Damped Weighted Least-Squares Method for Integrative Deconvolution of Single-Cell and Spatial Transcriptomics.

Interdisciplinary sciences, computational life sciences·2026
See all related articles
  1. Home
  2. Clustering Single-cell Rna-seq Data With Low-rank Matrix Factorization And Local Graph Regularization.
  1. Home
  2. Clustering Single-cell Rna-seq Data With Low-rank Matrix Factorization And Local Graph Regularization.

Related Experiment Video

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.7K

Clustering Single-Cell RNA-Seq Data with Low-Rank Matrix Factorization and Local Graph Regularization.

Yue Yu1, Wei Zhang2,3, Xiaoying Zheng4

  • 1School of Sciences, East China Jiaotong University, Nanchang, 330013, China.

Interdisciplinary Sciences, Computational Life Sciences
|September 2, 2025

View abstract on PubMed

Summary
This summary is machine-generated.

This study introduces a novel clustering algorithm, LRMGC, for single-cell RNA sequencing (scRNA-seq) data. LRMGC accurately identifies cell types by robustly handling noisy, high-dimensional data for improved biological insights.

Keywords:
ClusteringLocal graph regularizedLow rank representationSchatten p-normSingle-cell RNA sequencing

Frequently Asked Questions

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

892
Using R, Seurat, and CellChat to Analyze a Single-Cell Transcriptomics Dataset of Mouse Skin Wound Healing
08:58

Using R, Seurat, and CellChat to Analyze a Single-Cell Transcriptomics Dataset of Mouse Skin Wound Healing

Published on: August 1, 2025

488

Related Experiment Videos

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.7K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

892
Using R, Seurat, and CellChat to Analyze a Single-Cell Transcriptomics Dataset of Mouse Skin Wound Healing
08:58

Using R, Seurat, and CellChat to Analyze a Single-Cell Transcriptomics Dataset of Mouse Skin Wound Healing

Published on: August 1, 2025

488

Area of Science:

  • Computational biology and bioinformatics focusing on single-cell transcriptomics.
  • Machine learning applications in genomic data processing using low-rank matrix decomposition.
  • Statistical modeling of cellular heterogeneity through graph-based regularization techniques.

Background:

Single-cell Ribonucleic Acid sequencing (scRNA-seq) provides a high-resolution view of transcriptomic landscapes within individual cells. Prior research has shown that identifying distinct cell types is a fundamental step for interpreting biological diversity and disease mechanisms. Understanding the specific molecular signatures of individual cells allows scientists to map developmental trajectories and identify pathological changes. Existing computational frameworks often struggle with the inherent technical noise and extreme sparsity found in these massive datasets. Conventional Low-Rank Representation (LRR) techniques frequently fail to distinguish between biological signal and stochastic artifacts. These mathematical limitations often lead to the conflation of true biological patterns with experimental noise. This absence of evidence motivated the development of more robust mathematical frameworks to handle high-dimensional genomic inputs.

LRMGC integrates low-rank matrix decomposition with local graph regularization to isolate biological signals from technical noise. By applying a tri-decomposition strategy, the algorithm derives an aligned core matrix that accurately characterizes cellular distances in a lower-dimensional space, facilitating precise cell type identification.

The algorithm applies the Schatten p-norm to the core matrix instead of the traditional kernel norm. This specific mathematical choice allows the framework to robustly learn the similarity matrix against outliers while maintaining the underlying subspace structure of high-dimensional, sparse transcriptomic datasets.

The researchers utilized the angular alignment strategy on the similarity matrix to refine the final clustering output. This step ensures that the relationships between cells are geometrically consistent, which revealed more accurate cell type compositions during comprehensive experiments on diverse scRNA-seq datasets.

While highly effective for cell type identification, the study's findings are primarily validated through marker gene identification, functional enrichment, and cell-cell communication. The authors imply that the algorithm's performance is specifically optimized for addressing the high dimensionality, sparsity, and noise inherent in scRNA-seq data.

The study's authors propose that LRMGC provides a superior and reliable performance for uncovering cell type composition. They conclude that the effectiveness of this approach in rare cell recognition and functional enrichment analysis makes it a valuable tool for various downstream single-cell transcriptomic investigations.

Purpose Of The Study:

This research introduces a novel clustering algorithm termed Low-Rank Matrix decomposition with local Graph regularization (LRMGC) to enhance cell type identification. The investigators sought to overcome the limitations of standard kernel norms by implementing a more flexible Schatten p-norm approach. Refined similarity matrices must accurately reflect the underlying subspace structure of complex biological samples. The project focuses on integrating local manifold constraints to preserve the geometric relationships between individual cellular profiles. Robustness against outliers remains a primary objective for ensuring reliable downstream biological interpretations. This effort aims to provide a scalable solution for processing the high-dimensional noise characteristic of modern transcriptomic assays. Accurate classification of cellular subsets is essential for uncovering the mechanisms driving tissue heterogeneity.

Main Methods:

The LRMGC framework utilizes a tri-decomposition strategy applied to the representation matrix to extract an aligned core matrix. Researchers incorporated a local manifold regularization term to characterize the spatial distance between cells in a reduced dimensionality. Instead of standard mathematical constraints, the Schatten p-norm is applied to the core matrix to improve resistance to data outliers. An angular alignment strategy is subsequently executed on the similarity matrix to refine the final clustering output. Performance evaluation involved comprehensive comparisons against multiple advanced computational methods using diverse scRNA-seq datasets. The methodology also includes specific modules for marker gene identification and functional enrichment analysis to validate biological relevance. Rare cell recognition and cell-cell communication patterns were analyzed to test the versatility of the proposed algorithm across different biological contexts.

Main Results:

LRMGC demonstrated superior performance and reliability in uncovering cell type compositions compared to existing state-of-the-art algorithms. The application of the Schatten p-norm effectively preserved the subspace structure of high-dimensional noisy data. Experimental results confirmed that the tri-decomposition strategy successfully isolated the core biological signals from technical artifacts. The algorithm accurately identified rare cell populations that were often overlooked by less sensitive clustering techniques. Functional enrichment analysis and marker gene identification provided strong evidence for the biological accuracy of the clusters. Cell-cell communication analyses revealed intricate signaling networks that aligned with known physiological pathways. Statistical validation indicated that the similarity matrix learned by LRMGC was significantly more robust against outliers than those produced by kernel-norm-based methods.

Conclusions:

The integration of low-rank decomposition with local graph constraints provides a robust solution for single-cell transcriptomic analysis. These findings suggest that LRMGC can significantly improve the precision of downstream genomic investigations. Enhanced cell type identification facilitates a deeper understanding of cellular heterogeneity in complex biological systems. The researchers propose that this mathematical framework is highly effective for managing the sparsity and noise of scRNA-seq data. Future applications may involve applying this clustering approach to larger multi-omic datasets to reveal broader regulatory mechanisms. This computational tool offers a reliable foundation for discovering novel biomarkers and therapeutic targets in clinical research. Improved clustering reliability ensures that subsequent analyses, such as trajectory inference, are based on accurate cellular groupings.