Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Methods of Documentation VII: EMR01:30

Methods of Documentation VII: EMR

784
Electronic Medical Records (EMRs) primarily center around electronically documenting patients' health information within a single healthcare organization or practice. They contain essential clinical data related to a patient's medical history, diagnoses, medications, treatment plans, lab results, and other pertinent information relevant to the specific encounter or episode of care. EMRs are designed to streamline documentation and workflow processes within individual healthcare...
784

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Transfer times for patients with extradural and subdural haematomas to neurosurgery in Greater Manchester.

British journal of neurosurgery·2007
Same author

Assessment of middle ear disease in the dog: a comparison of diagnostic imaging modalities.

The Journal of small animal practice·2007
Same author

Desensitized renal transplant recipients show reduced cellular responses to in vitro challenge.

Transplantation proceedings·2006
Same author

Succinate dehydrogenase and fumarate hydratase: linking mitochondrial dysfunction and cancer.

Oncogene·2006
Same author

The DNA sequence and biological annotation of human chromosome 1.

Nature·2006
Same author

Entamoeba histolytica: comparison of the role of receptors and filamentous actin among various endocytic processes.

Experimental parasitology·2006
Same journal

Broad-Spectrum Antibiotics are Associated with worse Survival After Radical Treatment for Glioblastoma Multiforme: A Multicentre Study.

Clinical oncology (Royal College of Radiologists (Great Britain))·2026
Same journal

Exploring the Need for Plan Adaptation in Robustly Optimised Head and Neck Intensity Modulated Proton Therapy: A Dosimetric Perspective From Initial Institutional Experience.

Clinical oncology (Royal College of Radiologists (Great Britain))·2026
Same journal

Lifting the Lid on Best Practice in a Case of Oligometastatic Breast Cancer.

Clinical oncology (Royal College of Radiologists (Great Britain))·2026
Same journal

Proton Beam Therapy for Intracranial Germ Cell Tumors in Pediatric, Adolescent, and Young Adult Patients: A Single-Center Experience.

Clinical oncology (Royal College of Radiologists (Great Britain))·2026
Same journal

Patient Perspectives on Reirradiation for Recurrent Cancer, to Inform Future Clinical Trial Design.

Clinical oncology (Royal College of Radiologists (Great Britain))·2026
Same journal

Clinical Oncologist and NHS Commissioner Perspectives on Reirradiation for Recurrent Cancer.

Clinical oncology (Royal College of Radiologists (Great Britain))·2026
See all related articles

Related Experiment Video

Updated: May 16, 2025

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma
04:09

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

8.2K

Natural Language Processing to Extract Head and Neck Cancer Data From Unstructured Electronic Health Records.

T Young1, J Au Yeung1, K Sambasivan1

  • 1Guy's and St Thomas' NHS Foundation Trust (GSTT), UK; King's College London, UK.

Clinical Oncology (Royal College of Radiologists (Great Britain))
|April 6, 2025
PubMed
Summary
This summary is machine-generated.

Artificial intelligence using Natural Language Processing (NLP) can efficiently extract valuable Head and Neck Cancer (HNC) data from unstructured electronic health records. Limited training improved data extraction accuracy, demonstrating the potential of AI in curating real-world clinical datasets.

Keywords:
Artificial intelligencedata miningnatural language processingreal-world data

More Related Videos

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

15.7K
TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients
09:00

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients

Published on: April 13, 2021

4.4K

Related Experiment Videos

Last Updated: May 16, 2025

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma
04:09

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

8.2K
A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

15.7K
TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients
09:00

TBase - an Integrated Electronic Health Record and Research Database for Kidney Transplant Recipients

Published on: April 13, 2021

4.4K

Area of Science:

  • Medical Informatics
  • Artificial Intelligence in Healthcare
  • Oncology Data Management

Background:

  • Electronic Health Records (EHRs) often contain unstructured patient data requiring manual curation.
  • Natural Language Processing (NLP) offers a potential solution for rapidly extracting and structuring this data.
  • Head and Neck Cancer (HNC) research can benefit from enriched datasets derived from EHRs.

Purpose of the Study:

  • To evaluate the effectiveness of an open-source healthcare NLP tool (CogStack) for extracting Head and Neck Cancer (HNC) patient data from unstructured EHRs.
  • To assess the performance of the NLP tool after limited supervised training cycles.
  • To determine if a thresholding strategy can improve data extraction precision.

Main Methods:

  • CogStack was employed to extract SNOMED-CT concepts from HNC patient documents.
  • Initial performance was assessed against manually curated ground truth data.
  • The model underwent two cycles of supervised training using annotated clinical documents.
  • A thresholding approach was implemented to enhance precision, and the final model was evaluated on an unseen test cohort.
  • F1 score was utilized as the primary evaluation metric.

Main Results:

  • Pre-training, F1 scores were incalculable for 19.5% of concepts due to low recall.
  • After one training cycle, F1 scores became calculable for all concepts (median 0.692).
  • The final model achieved a median F1 score of 0.708 after further training.
  • The test cohort yielded a median F1 score of 0.750, improving to 0.778 with concept-specific thresholding.
  • 50 out of 109 SNOMED-CT concepts met predefined criteria for adequate fine-tuning.

Conclusions:

  • NLP tools like CogStack can effectively mine unstructured cancer data with limited training.
  • Model performance generalized well to an unseen test cohort, indicating robustness.
  • A concept-specific thresholding strategy significantly improved extraction performance.
  • While generally effective, certain concepts like histopathology terms remained challenging to retrieve accurately.
  • The validated NLP approach was successfully applied to extract data for 50 concepts across the entire retrospective HNC cohort.