Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Evaluating UMLS strings for natural language processing.

A T McCray1, O Bodenreider, J D Malley

  • 1National Library of Medicine, National Institutes of Health, USA.

Proceedings. AMIA Symposium
|February 5, 2002
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

On Contributing to the Progress of Medical Informatics as Publisher.

Yearbook of medical informatics·2017
Same author

Informatics Research, Development, and Training at the Lister Hill National Center for Biomedical Communications.

Yearbook of medical informatics·2016
Same author

Computing and Collaborative Care.

Yearbook of medical informatics·2016
Same author

Integration of Information for Patient Care.

Yearbook of medical informatics·2016
Same author

Health Informatics and the Internet.

Yearbook of medical informatics·2016
Same author

Patient-centered Systems.

Yearbook of medical informatics·2016
Same journal

Progressive display of very high resolution images using wavelets.

Proceedings. AMIA Symposium·2002
Same journal

The Chronus II temporal database mediator.

Proceedings. AMIA Symposium·2002
Same journal

Gene expression levels in different stages of progression in oral squamous cell carcinoma.

Proceedings. AMIA Symposium·2002
Same journal

An assessment of the visibility of MeSH-indexed medical web catalogs through search engines.

Proceedings. AMIA Symposium·2002
Same journal

Filtering for medical news items using a machine learning approach.

Proceedings. AMIA Symposium·2002
Same journal

Enriching the structure of the UMLS semantic network.

Proceedings. AMIA Symposium·2002
See all related articles

This study evaluates strings from the Unified Medical Language System (UMLS) for natural language processing (NLP) utility. Findings show specific string properties can predict their presence in biomedical corpora like MEDLINE.

Area of Science:

  • Biomedical Informatics
  • Natural Language Processing
  • Computational Linguistics

Background:

  • The Unified Medical Language System (UMLS) is a key resource for biomedical knowledge.
  • UMLS is utilized in various research and development applications, notably in Natural Language Processing (NLP).

Purpose of the Study:

  • To investigate the characteristics of strings within the UMLS Metathesaurus.
  • To assess the utility of these strings for NLP tasks.
  • To develop predictive models for string presence in biomedical corpora.

Main Methods:

  • Identification of string properties predictive of corpus presence.
  • Application of a statistical model to test these predictors.
  • Corpus derived from the MEDLINE database.

Related Experiment Videos

Main Results:

  • One set of properties achieved 77% accuracy in predicting strings absent from the corpus and 85% accuracy for strings present.
  • A second set of properties demonstrated 96% accuracy for predicting absent strings and 29% accuracy for present strings.

Conclusions:

  • String properties can effectively predict their relevance and presence within biomedical text corpora.
  • This predictive capability is valuable for optimizing NLP applications using the UMLS.