Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Standardized Information Model for Clinical Texts: The MII Core Data Set Module Document.

Studies in health technology and informatics·2026
Same author

The SURROGATOR Framework for Context-Aware Surrogation of Privacy Sensitive Information in Medical Text.

Studies in health technology and informatics·2026
Same author

Temporal Annotation of German Clinical Language in Real and Synthetic Clinical Documents: Corpus Development and Baseline Tagger Validation Study.

Journal of medical Internet research·2026
Same author

Collaborative framework on responsible AI in LLM-driven CDSS for precision oncology leveraging real-world patient data.

NPJ precision oncology·2025
Same author

Medical Entity Linking in Low-Resource Settings with Fine-Tuning-Free LLMs.

Studies in health technology and informatics·2025
Same author

GeMTeX's De-Identification in Action: Lessons Learned & Devil's Details.

Studies in health technology and informatics·2025
Same journal

A GenAI Pipeline for Violinist Kinematic Data Management.

Studies in health technology and informatics·2026
Same journal

AMAL-For-Qatar: A Comprehensive AI Ecosystem for Fetal Ultrasound Analysis - Project Overview and Achievements.

Studies in health technology and informatics·2026
Same journal

Longitudinal Treatment-Aware Multimodal AI for Dermatology: A Scoping Review.

Studies in health technology and informatics·2026
Same journal

Predicting Postpartum Depression Using Imbalance-Aware Machine Learning.

Studies in health technology and informatics·2026
Same journal

Validation of Deep-Learning Models for Autosegmentation of Brain Metastases.

Studies in health technology and informatics·2026
Same journal

Delay-Dependent Gating in Modular RNNs.

Studies in health technology and informatics·2026
See all related articles

Related Experiment Video

Updated: Jul 5, 2025

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients
09:00

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients

Published on: April 13, 2021

4.5K

Final Report on the German Clinical Reference Corpus 3000PA.

Udo Hahn1, Luise Modersohn1,2, Jakob Faller1,3

  • 1Jena University Language & Information Engineering (JULIE) Lab, Friedrich-Schiller-Universität Jena, Jena, Germany.

Studies in Health Technology and Informatics
|January 25, 2024
PubMed
Summary
This summary is machine-generated.

Researchers developed a national clinical reference corpus (3000PA) and complementary sharable corpora for German clinical natural language processing. These resources support advanced data infrastructure for medical informatics.

Keywords:
Clinical text corpusGerman languageannotationclinical NLP

More Related Videos

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

15.9K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

457

Related Experiment Videos

Last Updated: Jul 5, 2025

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients
09:00

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients

Published on: April 13, 2021

4.5K
A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

15.9K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

457

Area of Science:

  • Medical Informatics
  • Computational Linguistics
  • Natural Language Processing

Background:

  • The Medical Informatics Initiative (MII) is a large-scale German research program.
  • Development of robust data and software infrastructure is crucial for German-language clinical natural language processing.

Purpose of the Study:

  • To report on the development of a national clinical reference corpus (3000PA) and complementary sharable corpora.
  • To support the advancement of German clinical natural language processing.

Main Methods:

  • Developed 3000PA, a national clinical reference corpus from three university hospital patient records.
  • Annotated 3000PA with semantic layers: medical named entities, relations, certainty, and negation.
  • Created three sharable corpora: JSYNCC, GGPONC, and GRASCCO.

Main Results:

  • The 3000PA corpus contains detailed semantic annotations.
  • Three sharable corpora (JSYNCC, GGPONC, GRASCCO) complement 3000PA.
  • Combined corpora (3000PA, JSYNCC, GRASCCO) feature approximately 2.1 million metadata points.

Conclusions:

  • The developed corpora provide a valuable resource for German clinical natural language processing research.
  • These resources enhance the data and software infrastructure for medical informatics in Germany.