Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

dTagger: a POS tagger.

Guy Divita1, Allen C Browne, Russell Loane

  • 1National Library of Medicine, Bethesda, Maryland, USA.

AMIA ... Annual Symposium Proceedings. AMIA Symposium
|January 24, 2007
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

An ecological model in support of an ontology of mental functioning.

PLOS mental health·2026
Same author

Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features.

Knowledge-based systems·2023
Same author

Development of an ontology to characterize mental functioning.

Disability and rehabilitation·2023
Same author

Extracting body function information using rule-based methods: Highlighting structure and formatting challenges in clinical text.

Frontiers in digital health·2022
Same author

Informatics Research on Mental Health Functioning: Decision Support for the Social Security Administration Disability Program.

Psychiatric services (Washington, D.C.)·2022
Same author

Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.

Journal of the American Medical Informatics Association : JAMIA·2020
Same journal

Sensitivity Analyses of a Scoring System for a Contraception Decision Aid.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Improving electronic health record processing of large language models via retrieval-augmented generation: A case study on dietary supplements.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Developing a User-Centered Mobile Application Prototype: Bridging Lower-Limb Fracture Care from Skilled Nursing Facility and Back to the Community.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Automating Adjudication of Cardiovascular Events Using Large Language Models.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Predictive Factors and State-Level Barriers to Postpartum Birth Control Usage in the United States: Insights from PRAMS Phase 8.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
See all related articles

The National Library of Medicine developed dTagger, a Part-of-Speech (POS) tagger for natural language processing (NLP) tools. This trainable tagger achieves 95% accuracy on biomedical text, enhancing information extraction.

Area of Science:

  • Natural Language Processing (NLP)
  • Computational Linguistics
  • Biomedical Informatics

Background:

  • The National Library of Medicine's Lexical Systems Group developed a Part-of-Speech (POS) tagger.
  • The tagger, named dTagger, is part of the SPECIALIST Natural Language Processing (NLP) Tools.

Purpose of the Study:

  • To develop and distribute a versatile POS tagger for use with the SPECIALIST lexicon or arbitrary tag sets.
  • To enhance the processing of biomedical literature through accurate linguistic annotation.

Main Methods:

  • dTagger is designed for single or multi-word chunking and is trainable with annotated text.
  • It incorporates shape identification and suffix statistics for handling unknown words.
  • Users can add local lexicon content and receive sentence tagging likelihoods.

Related Experiment Videos

Main Results:

  • Supervised training achieved 95% performance on a modified MedPost corpus of hand-annotated Medline abstracts.
  • Eight percent of terms within the corpus were identified as multi-word entities.
  • The tagger effectively handles unknown words through shape identification and statistical heuristics.

Conclusions:

  • dTagger offers a high-performance, adaptable solution for Part-of-Speech tagging in biomedical NLP.
  • Its capabilities in handling multi-word entities and unknown terms improve the analysis of complex scientific text.
  • The free distribution of dTagger promotes advancements in biomedical text mining and information retrieval.