PuMA: PubMed gene/cell type-relation Atlas
View abstract on PubMed
Summary
This summary is machine-generated.This study introduces the PubMed Gene/Cell type-Relation Atlas (PuMA), a tool for literature-driven cell type annotation. PuMA extracts gene and cell type information from PubMed, aiding researchers in automated cell type identification.
Area Of Science
- Bioinformatics
- Computational Biology
- Genomics
Background
- Automated cell type annotation is crucial for single-cell analysis.
- Existing tools rely on curated databases or machine learning, but incorporating literature knowledge is underexplored.
Purpose Of The Study
- To develop a tool for literature-driven cell type annotation using PubMed.
- To facilitate the extraction and visualization of gene-cell type relationships from scientific literature.
Main Methods
- Developed the PubMed Gene/Cell type-Relation Atlas (PuMA) with a web interface.
- Utilized a machine learning-based named entity recognition model to extract gene and cell type concepts from PubMed.
- Linked biomedical ontologies and developed a ranking score for gene-cell type relations.
Main Results
- PuMA provides a searchable interface with interactive graph visualizations for exploring gene-cell type relations.
- Results are traceable to specific PubMed articles.
- PuMA demonstrated competitive performance against a manually curated database across mouse and human datasets.
Conclusions
- PuMA enables automated cell type annotation using PubMed articles, complementing existing databases.
- The software framework supports incremental knowledge updates through regular article imports.
- PuMA offers researchers a valuable tool for analyzing and automating cell type annotation based on the latest scientific literature.
Related Concept Videos
Multicellular organisms contain a variety of structurally and functionally distinct cell types, but the DNA in all the cells originated from the same parent cells. The differences in the cells can be attributed to the differential gene expression. Liver cells, whose functions include detoxification of blood, production of bile to metabolize fats, and synthesis of proteins essential for metabolism, must express a specific set of genes to perform their functions. Gene expression also varies with...
The extent of chromatin compaction can be studied by staining chromatin using specific DNA binding dyes. Under the microscope, the dense-compacted regions that take up more dye are called heterochromatin. Heterochromatin is further classified into two forms – constitutive heterochromatin and facultative heterochromatin.
Constitutive heterochromatin: It is a highly compact region of chromatin that is mostly concentrated in the centromere and telomere. Unlike euchromatin, the amino acid at...
Cyclic Adenosine Monophosphate (cAMP) is an essential second messenger that activates protein kinase A (PKA) and regulates various biological processes. A single epinephrine molecule binds to GPCR and activates several heterotrimeric G proteins, each stimulating multiple adenylyl cyclase, amplifying the signal, and synthesizing large numbers of cAMP molecules. Small changes in cAMP concentration affect PKA activity. The binding of four cAMP molecules induces a conformational change in PKA,...
Membrane lipids such as phosphatidylinositol (PI) are precursors for several membrane-bound and soluble second messengers. Specific kinases phosphorylate PI and produce phosphorylated inositol phospholipids. One such inositol phospholipids are the phosphatidylinositol-4,5 bisphosphate [PI(4,5)P2], present in the inner half of the lipid bilayer. Upon ligand binding, GPCR stimulates Gq proteins to turn on phospholipase Cꞵ. Activated phospholipase Cꞵ cleaves PI(4,5)P2 and...
Some GPCRs transmit signals through adenylyl cyclase (AC), a transmembrane enzyme. AC helps synthesize second messenger cyclic adenosine monophosphate (cAMP). AC catalyzes cyclization reaction and converts ATP to cAMP by releasing a pyrophosphate. The pyrophosphate is further hydrolyzed to phosphate by the enzyme pyrophosphatase, which drives cAMP synthesis to completion. However, cAMP is rapidly degraded to 5′ AMP by the enzymes phosphodiesterase (PDE), preventing overstimulation of...
A cell line is a population of cells grown in vitro that can be subcultured over several generations. Normal cells cease to divide after a certain number of cell divisions, a process known as replicative senescence. This number, called the Hayflick limit, was conceptualized by Leonard Hayflick in 1961 when he observed that fetal cells grown in culture could only divide 40-60 times. This limit is due to the shortening of the telomeres during each round of cell division, preventing cell division...

