Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Colonisation of Pathogens01:25

Colonisation of Pathogens

Pathogen colonization of host tissues is a critical step in the development of infectious diseases. Various pathogenic microorganisms, including bacteria, fungi, viruses, and protozoa, have evolved complex strategies to attach to, invade, and persist within host environments. These mechanisms enable pathogens to establish infections, evade immune responses, and resist antimicrobial treatments.Attachment to Host CellsIn bacteria, colonization typically begins with adherence to host epithelial...
Infection01:20

Infection

When a pathogen enters the body and reproduces, it can cause an infection, damage body cells, and cause illness symptoms that eventually lead to disease. Therefore, its prevention requires breaking the chain of infection.
The chain begins with pathogens: bacteria, viruses, fungi, prions, or parasites such as protozoa helminths. These can be present on the skin as transient or resident flora, or they can be acquired from the environment. Identifying and treating the type of infection and...
Antigen Processing Pathways01:31

Antigen Processing Pathways

MHC molecules are key players in the immune response, enabling T cells to recognize and respond to specific antigens. They are present on the surface of all nucleated cells in the body and are instrumental in presenting antigens to T cells and activating them. T cells recognize the MHC-antigen complex and initiate an immune response. MHC class I and MHC class II are two main types of MHC molecules, each associated with a distinct antigen processing pathway.
MHC Class I: Presenting Endogenous...
Introduction to Language of Pathophysiology l01:25

Introduction to Language of Pathophysiology l

Pathophysiology investigates how biological mechanisms—typically starting at the cellular level—disrupt normal bodily functions. It bridges anatomy and physiology to explain the progression of disease. With this foundation, it is important to understand the following key terms used to describe disease processes: Diagnosis:The process of identifying a disease using clinical evaluation, including signs (objective evidence like rashes), symptoms (subjective experiences like pain), laboratory test...
Infectious Diseases and Their Occurrence01:28

Infectious Diseases and Their Occurrence

Infectious diseases appear in populations through various transmission patterns, influenced by pathogen characteristics, population immunity, environmental conditions, and social behavior. Understanding these patterns is essential for effective public health surveillance and intervention. These categories—sporadic, outbreak, epidemic, pandemic, and endemic—help frame the nature and scope of disease events.Sporadic diseases occur irregularly and infrequently, without a predictable temporal or...
Introduction to Language of Pathophysiology ll01:17

Introduction to Language of Pathophysiology ll

This lesson explores key terms that describe how diseases progress, their outcomes, and their distribution in populations.Diagnostic tests identify diseases and monitor treatment. These include blood and urine tests, biopsies, imaging (X-ray, MRI), and detection of infectious agents.Remission is a reduction or disappearance of symptoms.Exacerbation refers to the worsening of symptoms, such as increased wheezing during an asthma attack.A precipitating factor triggers an acute episode, while a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Temporal Annotation of German Clinical Language in Real and Synthetic Clinical Documents: Corpus Development and Baseline Tagger Validation Study.

Journal of medical Internet research·2026
Same author

GeMTeX's De-Identification in Action: Lessons Learned & Devil's Details.

Studies in health technology and informatics·2025
Same author

Clinical document corpora-real ones, translated and synthetic substitutes, and assorted domain proxies: a survey of diversity in corpus design, with focus on German text data.

JAMIA open·2025
Same author

De-Identifying GRASCCO - A Pilot Study for the De-Identification of the German Medical Text Project (GeMTeX) Corpus.

Studies in health technology and informatics·2024
Same author

Final Report on the German Clinical Reference Corpus 3000PA.

Studies in health technology and informatics·2024
Same author

Influence of Context in Transformer-Based Medication Relation Extraction.

Studies in health technology and informatics·2024
Same journal

Sensitivity Analyses of a Scoring System for a Contraception Decision Aid.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Improving electronic health record processing of large language models via retrieval-augmented generation: A case study on dietary supplements.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Developing a User-Centered Mobile Application Prototype: Bridging Lower-Limb Fracture Care from Skilled Nursing Facility and Back to the Community.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Automating Adjudication of Cardiovascular Events Using Large Language Models.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Predictive Factors and State-Level Barriers to Postpartum Birth Control Usage in the United States: Insights from PRAMS Phase 8.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
See all related articles

Related Experiment Video

Updated: May 15, 2026

Using a Bacterial Pathogen to Probe for Cellular and Organismic-level Host Responses
08:38

Using a Bacterial Pathogen to Probe for Cellular and Organismic-level Host Responses

Published on: February 22, 2019

Active Learning-based corpus annotation--the PathoJen experience.

Udo Hahn1, Elena Beisswanger, Ekaterina Buyko

  • 1Jena University Language & Information Engineering (JULIE) Lab, Friedrich-Schiller-Universität Jena, Fürstengraben 30, D-07743 Jena, Germany. udo.hahn@uni-jena.de

AMIA ... Annual Symposium Proceedings. AMIA Symposium
|January 11, 2013
PubMed
Summary
This summary is machine-generated.

This study introduces PathoJen, a new corpus for annotating pathological phenomena in medical texts. It uses a novel two-category system and active learning to improve disease entity recognition efficiently.

More Related Videos

A Visual Guide for Studying Behavioral Defenses to Pathogen Attacks in Leaf-Cutting Ants
08:10

A Visual Guide for Studying Behavioral Defenses to Pathogen Attacks in Leaf-Cutting Ants

Published on: October 12, 2018

A Mouse Model for Pathogen-induced Chronic Inflammation at Local and Systemic Sites
09:52

A Mouse Model for Pathogen-induced Chronic Inflammation at Local and Systemic Sites

Published on: August 8, 2014

Related Experiment Videos

Last Updated: May 15, 2026

Using a Bacterial Pathogen to Probe for Cellular and Organismic-level Host Responses
08:38

Using a Bacterial Pathogen to Probe for Cellular and Organismic-level Host Responses

Published on: February 22, 2019

A Visual Guide for Studying Behavioral Defenses to Pathogen Attacks in Leaf-Cutting Ants
08:10

A Visual Guide for Studying Behavioral Defenses to Pathogen Attacks in Leaf-Cutting Ants

Published on: October 12, 2018

A Mouse Model for Pathogen-induced Chronic Inflammation at Local and Systemic Sites
09:52

A Mouse Model for Pathogen-induced Chronic Inflammation at Local and Systemic Sites

Published on: August 8, 2014

Area of Science:

  • Natural Language Processing
  • Medical Informatics
  • Computational Linguistics

Background:

  • Accurate identification of pathological phenomena in biomedical literature is crucial for medical research and clinical applications.
  • Existing annotation guidelines struggle to capture the nuances of disease mentions, especially less structured descriptive statements.
  • Developing robust named entity recognition (NER) systems for pathology requires high-quality annotated data.

Purpose of the Study:

  • To present the design decisions and annotation procedures for PathoJen, a novel corpus of Medline abstracts annotated for pathological phenomena.
  • To introduce a two-category encoding schema to differentiate between standardized disease terminology and descriptive pathological statements.
  • To implement an Active Learning-based sampling strategy to optimize annotation cost and quality.

Main Methods:

  • Development of PathoJen corpus using Medline abstracts.
  • Implementation of a two-category encoding schema: short mentions for standardized terms (e.g., diseases) and long mentions for descriptive pathological statements.
  • Application of an Active Learning approach for intelligent sampling of annotation instances to reduce costs and improve model performance.

Main Results:

  • PathoJen corpus successfully annotated for pathological phenomena, including diseases.
  • The two-category encoding schema effectively distinguishes between different types of pathological mentions.
  • Active Learning strategy demonstrated efficiency in annotation by focusing on challenging instances, maintaining quality.

Conclusions:

  • PathoJen provides a valuable resource for training and evaluating NER systems for pathological entities.
  • The proposed annotation schema and Active Learning approach offer a cost-effective method for building specialized biomedical corpora.
  • This work advances the field of biomedical NLP by improving the capture of complex pathological information.