Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A comparison of Fast Healthcare Interoperability Resources and Observational Medical Mutcomes Partnership electronic health record data within the All of Us Research Program.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

Heterogeneity of Treatment Effects Across Nine Glucose-Lowering Drug Classes in Type 2 Diabetes: Extension of the LEGEND-T2DM Network Study.

Diabetes, obesity & metabolism·2026
Same author

Semaglutide and Neovascular Age-Related Macular Degeneration Among Adults with Type 2 Diabetes: An OHDSI Network Study.

Ophthalmology·2026
Same author

Return of results is important to heterogeneous research participants: A single-site survey.

Journal of clinical and translational science·2026
Same author

Target Trial Emulation of Vaccine Effectiveness in 5- to 17-years-olds with Prior SARS-CoV-2 Infection.

Nature communications·2026
Same author

Comparative Cardiovascular Effectiveness of Glucagon-Like Peptide 1 Receptor Agonists and Sodium-Glucose Cotransporter 2 Inhibitors in Diabetes Mellitus.

Journal of the American College of Cardiology·2026
Same journal

Sensitivity Analyses of a Scoring System for a Contraception Decision Aid.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Improving electronic health record processing of large language models via retrieval-augmented generation: A case study on dietary supplements.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Developing a User-Centered Mobile Application Prototype: Bridging Lower-Limb Fracture Care from Skilled Nursing Facility and Back to the Community.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Automating Adjudication of Cardiovascular Events Using Large Language Models.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Predictive Factors and State-Level Barriers to Postpartum Birth Control Usage in the United States: Insights from PRAMS Phase 8.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
See all related articles

Related Experiment Video

Updated: Jun 14, 2026

PIPEMAT-RS: Development and Validation of a Standardized MATLAB Pipeline for Resting-State EEG Preprocessing
06:51

PIPEMAT-RS: Development and Validation of a Standardized MATLAB Pipeline for Resting-State EEG Preprocessing

Published on: June 6, 2025

Using a pipeline to improve de-identification performance.

Frances P Morrison1, Soumitra Sengupta, George Hripcsak

  • 1Columbia University Department of Biomedical Informatics.

AMIA ... Annual Symposium Proceedings. AMIA Symposium
|March 31, 2010
PubMed
Summary
This summary is machine-generated.

Pipelining two text-processing systems, deid.pl and MedLEE, effectively de-identified clinical notes, retaining only 2% of protected health information (PHI). This strategy enhances electronic health record data reuse for research.

More Related Videos

A Versatile Pipeline for Analyzing Dynamic Changes in Nuclear Bodies in a Variety of Cell Types
06:33

A Versatile Pipeline for Analyzing Dynamic Changes in Nuclear Bodies in a Variety of Cell Types

Published on: June 28, 2024

Related Experiment Videos

Last Updated: Jun 14, 2026

PIPEMAT-RS: Development and Validation of a Standardized MATLAB Pipeline for Resting-State EEG Preprocessing
06:51

PIPEMAT-RS: Development and Validation of a Standardized MATLAB Pipeline for Resting-State EEG Preprocessing

Published on: June 6, 2025

A Versatile Pipeline for Analyzing Dynamic Changes in Nuclear Bodies in a Variety of Cell Types
06:33

A Versatile Pipeline for Analyzing Dynamic Changes in Nuclear Bodies in a Variety of Cell Types

Published on: June 28, 2024

Area of Science:

  • Medical Informatics
  • Natural Language Processing
  • Health Data Security

Background:

  • Electronic health records (EHRs) contain valuable data for research.
  • De-identification is crucial for protecting patient privacy and enabling data reuse.
  • Existing de-identification methods require optimization for accuracy and efficiency.

Purpose of the Study:

  • To evaluate a novel de-identification strategy using a pipeline of two text-processing systems.
  • To assess the effectiveness of combining deid.pl and MedLEE for removing protected health information (PHI) from clinical notes.
  • To determine the error rate and nature of retained PHI in de-identified clinical notes.

Main Methods:

  • A pipeline approach was implemented, processing 100 outpatient clinical notes sequentially through deid.pl (PhysioToolkit) and then MedLEE.
  • Manual comparison of the de-identified output against original notes was performed to identify retained PHI.
  • Error rates and types of retained personal health information were analyzed.

Main Results:

  • The pipelined system achieved a low overall error rate of 2% in de-identifying clinical notes.
  • Only two personal names were retained in the output, one initial and a common medical term.
  • All retained PHI was successfully transformed into standardized medical concepts, reducing re-identification risk.

Conclusions:

  • Pipelining deid.pl with MedLEE significantly improved the exclusion of PHI from de-identified clinical notes.
  • This tandem text-processing strategy offers a promising approach for de-identifying clinical data.
  • The method provides computer-readable output, facilitating the reuse of electronic health record data for research purposes.