Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Using text analysis to identify functionally coherent gene groups.

Soumya Raychaudhuri1, Hinrich Schütze, Russ B Altman

  • 1Department of Genetics, Stanford Medical Informatics, University, Stanford, California 94305-5479, USA.

Genome Research
|October 9, 2002
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

TikTok is a valuable data source for tracking the opioid crisis.

NPJ digital medicine·2026
Same author

Drug-Target Interaction Prediction with PIGLET.

bioRxiv : the preprint server for biology·2026
Same author

GATSBI: Improving context-aware protein embeddings through biologically motivated data splits.

bioRxiv : the preprint server for biology·2026
Same author

Biological data governance in an age of AI.

Science (New York, N.Y.)·2026
Same author

The Human Omnibus of Targetable Pockets.

Journal of cheminformatics·2025
Same author

Publisher Correction: CRISPR-GPT for agentic automation of gene-editing experiments.

Nature biomedical engineering·2025
Same journal

Complete sequencing of medaka genomes reveals the architecture of centromeric satellites, giant mobile elements, and sex chromosomes.

Genome research·2026
Same journal

Convergence and conflict among telomere specialized transposons across 60 million years of Drosophilid evolution.

Genome research·2026
Same journal

A unified analysis of cell type- and trajectory-associated pathways in single-cell data using Phoenix.

Genome research·2026
Same journal

Resf1 is required for proper placental development and configuration of trophoblast cell-specific heterochromatin.

Genome research·2026
Same journal

Telomere-driven replicative crisis is driven by large-scale changes in genomic architecture.

Genome research·2026
Same journal

Spatially informed reference-free cell-type deconvolution for spatial transcriptomics with SpatialCD.

Genome research·2026
See all related articles

We developed neighbor divergence, a novel method using natural language processing to analyze scientific literature and identify functionally coherent gene groups. This approach effectively distinguishes biologically relevant gene clusters from random ones.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Genomic data analysis often involves clustering genes based on experimental features, leading to numerous groups.
  • Identifying functionally relevant gene clusters among these large sets can be challenging.
  • The scientific literature is a rich, yet underutilized, resource for understanding gene function.

Purpose of the Study:

  • To present a computational method, neighbor divergence, for assessing the functional coherence of gene groups using scientific literature.
  • To evaluate the efficacy of neighbor divergence in distinguishing known functional gene sets.

Main Methods:

  • Utilizes statistical natural language processing to interpret biological text from a relevant document corpus.
  • Assigns a numerical score to gene groups based on literature-derived functional coherence.

Related Experiment Videos

  • Requires a corpus of documents and an index linking documents to genes.
  • Main Results:

    • Neighbor divergence achieved 79% sensitivity at 100% specificity in distinguishing known functional gene groups from random sets.
    • The method performed favorably compared to other tested approaches.
    • Successfully applied to previously published gene expression clusters to validate its ability to recognize manually identified functional groups.

    Conclusions:

    • Neighbor divergence provides a robust, literature-based approach to assess gene group functional coherence.
    • This method aids in interpreting large-scale genomic data by leveraging the wealth of published biological knowledge.
    • Offers a valuable tool for researchers in genomics and bioinformatics to identify meaningful gene clusters.