SCassist: An AI Based Workflow Assistant for Single-Cell Analysis
- Vijayaraj Nagarajan 1, Guangpu Shi 1, Samyuktha Arunkumar 1, Chunhong Liu 2, Jaanam Gopalakrishnan 3, Pulak R Nath 4, Junseok Jang 1, Rachel R Caspi 1
- 1Laboratory of Immunology, National Eye Institute, NIH, Bethesda 20892, USA.
- 2Neuro-Immune Regulome Unit (Alumni), National Eye Institute, NIH, Bethesda 20892, USA.
- 3Neuro-Immune Regulome Unit, National Eye Institute, NIH, Bethesda 20892, USA.
- 4Laboratory of Immunology (Alumni), National Eye Institute, NIH, Bethesda 20892, USA.
- 0Laboratory of Immunology, National Eye Institute, NIH, Bethesda 20892, USA.
Related Experiment Videos
Contact us if these videos are not relevant.
Contact us if these videos are not relevant.
View abstract on PubMed
Summary
This summary is machine-generated.SCassist simplifies complex single-cell RNA sequencing (scRNA-seq) analysis using large language models (LLMs). This R package offers guided recommendations and interpretations, making advanced scRNA-seq data analysis more accessible.
Area Of Science
- Genomics
- Bioinformatics
- Computational Biology
Background
- Single-cell RNA sequencing (scRNA-seq) analysis is a complex, multi-step process demanding significant bioinformatics expertise and time.
- Existing workflows often present challenges for researchers, limiting accessibility and efficiency in biological data interpretation.
Purpose Of The Study
- To develop an R package, SCassist, that integrates large language models (LLMs) to streamline and enhance scRNA-seq data analysis.
- To provide researchers with intelligent, LLM-driven guidance for critical analysis steps and interpretation of results.
Main Methods
- Developed SCassist, an R package utilizing LLMs (Google's Gemini, OpenAI's GPT, Meta's Llama3) for scRNA-seq analysis.
- Integrated LLM-powered recommendations for data filtering, normalization, and clustering parameters.
- Implemented LLM-guided interpretation of variable features, principal components, cell type annotation, and enrichment analysis.
Main Results
- SCassist provides automated, data-driven recommendations for optimizing scRNA-seq analysis parameters.
- The package offers insightful, LLM-generated interpretations of complex genomic data, including feature significance and cell population identification.
- Demonstrated enhanced accessibility of sophisticated scRNA-seq analysis for researchers across various experience levels.
Conclusions
- SCassist significantly reduces the complexity and time required for scRNA-seq data analysis.
- Leveraging LLMs in bioinformatics tools like SCassist democratizes advanced genomic data interpretation.
- The R package empowers researchers to conduct more robust and accessible scRNA-seq studies.
Related Experiment Videos
Contact us if these videos are not relevant.
Contact us if these videos are not relevant.

