Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

188
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
188
Classification of Illness01:17

Classification of Illness

7.9K
The meaning of illness is individualized to each person who experiences an alteration in health. In contrast, disease is a medical term indicating a pathological change in the structure and function of the body or mind. It is a condition that has specific symptoms and boundaries.
An illness is a response to a disease in which the person's level of functioning is changed compared with a previous level. The general classification of illness includes acute and chronic.
Acute illness is severe...
7.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The clinical utility of bronchoalveolar lavage galactomannan result stewardship within a tertiary medical system.

Medical mycology·2026
Same author

Automated Prediction of Glasgow Coma Scale Scores From Unstructured Electronic Health Records Using Natural Language Processing: Development and Validation Study.

Journal of medical Internet research·2026
Same author

A myco-management problem: improving utilization of fungal and mycobacterial smear and culture.

Journal of clinical microbiology·2026
Same author

Delirium and Increased Risk of Developing Dementia: An Emulated Target Trial Analysis.

medRxiv : the preprint server for health sciences·2026
Same author

Late-Onset Seizures: Etiology and Demographics in US Tertiary Care Epilepsy Centers.

Neurology·2026
Same author

Mindful diagnostics: a central nervous system infection case study.

Antimicrobial stewardship & healthcare epidemiology : ASHE·2026
Same journal

Pregnancy-Related Clinical Codes in Unlikely Populations in Primary Care.

JMIR medical informatics·2026
Same journal

Selecting, Scaling, and Measuring the Value of Ambient AI in a Nonacademic Health System: Multiphase Pilot Study.

JMIR medical informatics·2026
Same journal

Prediction of Early Hospital Admission (≤24 Hours) After Stroke Using Machine Learning and Deep Learning: Multicenter Study From China.

JMIR medical informatics·2026
Same journal

Assessing the Feasibility and Acceptability of Implementing a Preclinic Vital Signs Assessment in Primary Care: Cross-Sectional Pilot Study.

JMIR medical informatics·2026
Same journal

Candidate Passive Sensor Suite Technologies for Tactical Combat Casualty Care Environments: Comparative Assessment Study.

JMIR medical informatics·2026
Same journal

Relevance of the uMap Collaborative Platform as Support for Choropleth Mapping: A Traffic‒Light Statistical Signal Atlas of All-Cause Mortality-First French Lockdown.

JMIR medical informatics·2026
See all related articles

Related Experiment Video

Updated: Sep 9, 2025

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.6K

A Machine Learning Approach for Identifying People With Neuroinfectious Diseases in Electronic Health Records:

Arjun Singh1,2, Shadi Sartipi1,2, Haoqi Sun2

  • 1Department of Neurology, Massachusetts General Hospital, 55 Fruit St, Wang ACC 835, Boston, MA, 02114, United States, 1 2163379887.

JMIR Medical Informatics
|August 29, 2025
PubMed
Summary
This summary is machine-generated.

A new natural language processing (NLP) model accurately identifies neuroinfectious diseases (NID) from clinical notes, outperforming traditional billing codes and other AI models. This machine learning approach offers a reliable tool for NID research and patient cohort identification.

Keywords:
EHRartificial intelligenceclinical notesdevelopefficiencyelectronic health recordsexpressionslogistic regressionmachine learningneuroinfectiousneuroinfectious diseaseneurologyvalidate

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.8K

Related Experiment Videos

Last Updated: Sep 9, 2025

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images
08:20

Author Spotlight: AI-Driven Trypanosome Species Detection from Microscopic Images

Published on: October 27, 2023

1.6K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.8K

Area of Science:

  • Computational medicine
  • Medical informatics
  • Natural Language Processing (NLP)

Background:

  • Identifying neuroinfectious diseases (NID) using billing codes is imprecise, and manual chart reviews are labor-intensive.
  • Machine learning (ML) can analyze unstructured electronic health records (EHRs) for subtle NID indicators, improving efficiency and reducing misclassification.
  • Accurate NID classification is crucial for research and clinical decision support, yet using unstructured notes remains underexplored.

Purpose of the Study:

  • To develop and validate an ML model for identifying NIDs from unstructured patient notes.
  • To compare the NLP model's performance against ICD billing codes and a large language model (LLM).
  • To assess the model's generalizability across different healthcare institutions.

Main Methods:

  • An extreme gradient boosting (XGBoost) model was trained on 3000 clinical notes from Mass General Brigham (MGB).
  • Notes were processed using n-gram representations (n=1, 2, 3), with feature selection via L1 regularization.
  • Performance was evaluated using AUROC and AUPRC, with external validation on data from Beth Israel Deaconess Medical Center (BIDMC).

Main Results:

  • The NLP model achieved an AUROC of 0.98 and AUPRC of 0.89 on MGB test data, balancing specificity (0.96) and sensitivity (0.84).
  • ICD billing codes showed high sensitivity (0.97) but poor specificity (0.59), while Llama 3.2 had improved specificity (0.94) but low sensitivity (0.64).
  • The NLP model maintained strong performance on external BIDMC data (AUROC 0.98, AUPRC 0.78).

Conclusions:

  • The NLP model accurately identifies NID cases from clinical notes, demonstrating feasibility for large-scale NID research.
  • The model's performance across two independent hospital datasets highlights its potential for cohort generation.
  • Further external validation is recommended to enhance the generalizability of these findings to other institutions.