Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Finding scientific topics.

Thomas L Griffiths1, Mark Steyvers

  • 1Department of Psychology, Stanford University, Stanford, CA 94305, USA. gruffydd@psych.stanford.edu

Proceedings of the National Academy of Sciences of the United States of America
|February 12, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A folk taxonomy of magic.

Cognition·2026
Same author

A reporting checklist for large language models in behavioural science.

Nature human behaviour·2026
Same author

Resolving Feynman's restaurant problem reveals optimal solutions and human strategies.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Considering Psychological Mechanisms Can Change the Interpretation of Bayesian Models.

Topics in cognitive science·2026
Same author

Aha! moments correspond to metacognitive prediction errors.

Cognition·2026
Same author

Whither symbols in the era of advanced neural networks?

Trends in cognitive sciences·2026
Same journal

Tau protein as a regulator of mitochondrial function and dynamics.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

A scalable, dividing cell model for the robust propagation and quantification of human sporadic Creutzfeldt-Jakob disease prions.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Epigenetic regulation of mesenchymal BMP signaling directs postnatal organ innervation.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Single-shot wide-field biochemical imaging at 1 kHz frame rate.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Morphogenesis and topological evolution of a frustrated nematic liquid crystal under confinement.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

B cell-intrinsic CXCR3 drives efficient generation of ectopic pulmonary germinal center responses to influenza A virus infection.

Proceedings of the National Academy of Sciences of the United States of America·2026
See all related articles

This study introduces a generative document model and Markov chain Monte Carlo algorithm to identify topics within documents. The model successfully extracts meaningful topics from scientific abstracts, aiding content discovery.

Area of Science:

  • Computational Linguistics
  • Machine Learning
  • Information Retrieval

Background:

  • Document content identification is crucial for information retrieval.
  • Existing methods require robust topic modeling techniques.
  • Generative models offer a probabilistic approach to understanding document structure.

Purpose of the Study:

  • To describe a generative model for documents based on topic distributions.
  • To present a Markov chain Monte Carlo (MCMC) algorithm for model inference.
  • To apply the model to analyze scientific abstracts and identify underlying topics.

Main Methods:

  • Developed a generative probabilistic model where documents are characterized by topic distributions.
  • Employed a Markov chain Monte Carlo (MCMC) algorithm for efficient inference within the generative model.

Related Experiment Videos

  • Utilized Bayesian model selection to determine the optimal number of topics for document analysis.
  • Main Results:

    • The generative model successfully identified meaningful topics within PNAS abstracts.
    • Extracted topics demonstrated consistency with author-assigned article classifications.
    • The analysis revealed the model's capability to capture underlying semantic structure in scientific text.

    Conclusions:

    • The proposed generative model and MCMC inference provide an effective method for topic discovery in documents.
    • This approach facilitates the identification of "hot topics" through temporal analysis.
    • The model can be used for semantic tagging of abstracts, enhancing content discoverability.