Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

What's missing in geographical parsing?

Milan Gritta1, Mohammad Taher Pilehvar1, Nut Limsopatham1

  • 1Language Technology Lab (LTL), Department of Theoretical and Applied Linguistics (DTAL), University of Cambridge, 9 West Road, Cambridge, CB3 9DP UK.

Language Resources and Evaluation
|July 2, 2019
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Generative language models exhibit social identity biases.

Nature computational science·2024
Same author

Semantic Speech Networks Linked to Formal Thought Disorder in Early Psychosis.

Schizophrenia bulletin·2023
Same author

A survey on clinical natural language processing in the United Kingdom from 2007 to 2022.

NPJ digital medicine·2022
Same author

BioCaster in 2021: automatic disease outbreaks detection from global news media.

Bioinformatics (Oxford, England)·2022
Same author

Exploiting document graphs for inter sentence relation extraction.

Journal of biomedical semantics·2022
Same author

A Conceptual Framework for Representing Events Under Public Health Surveillance.

Studies in health technology and informatics·2022
Same journal

The properties of panels in global comics: frequency and size of 76 K panels in 1,030 comics from 144 countries.

Language resources and evaluation·2026
Same journal

Towards a resource for multilingual lexicons: an MT assisted and human-in-the-loop multilingual parallel corpus with multi-word expression annotation.

Language resources and evaluation·2026
Same journal

OjibweMorph: an approachable finite-state transducer for Ojibwe (and beyond).

Language resources and evaluation·2026
Same journal

The narratives of war (NoW) corpus of written testimonies of the Russia-Ukraine war.

Language resources and evaluation·2025
Same journal

How different is different? Systematically identifying distribution shifts and their impacts in NER datasets.

Language resources and evaluation·2025
Same journal

Cross-linguistically consistent semantic and syntactic annotation of child-directed speech.

Language resources and evaluation·2025
See all related articles

Geoparsing, the conversion of place names to coordinates, faces challenges due to diverse language and ambiguity. This study analyzes leading geoparsers and introduces a new Wikipedia corpus to advance the field.

Area of Science:

  • Natural Language Processing (NLP)
  • Geospatial Information Systems (GIS)
  • Computational Linguistics

Background:

  • Geoparsing converts textual place names into geographical coordinates, crucial for applications like emergency response and social media analysis.
  • Current geoparsing methods struggle with linguistic diversity, place name ambiguity, metonymic language, and limited contextual understanding.
  • Existing research often lacks cross-comparison and relies on limited, non-public datasets.

Purpose of the Study:

  • To evaluate and analyze the performance of leading geoparsing tools.
  • To identify and detail the specific challenges hindering accurate geoparsing.
  • To address the scarcity of open-source corpora by releasing an automatically geotagged Wikipedia corpus.

Main Methods:

Keywords:
GeocodingGeoparsingGeotaggingNEDNELNERNLP

Related Experiment Videos

  • Analysis of multiple leading geoparsing tools.
  • Evaluation across diverse textual corpora.
  • Development of an automated geotagging process for a large-scale corpus.
  • Main Results:

    • Detailed performance analysis of leading geoparsers on various datasets.
    • Identification of key challenges including linguistic variation and contextual limitations.
    • Publication of a new, large-scale, automatically geotagged Wikipedia corpus for research.

    Conclusions:

    • Geoparsing remains a complex task requiring robust solutions for real-world applications.
    • The study highlights critical areas for improvement in geoparsing technology.
    • The released corpus will facilitate further research and development in the geoparsing domain.