Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Software for Data Analysis and Clinical Trials01:12

Statistical Software for Data Analysis and Clinical Trials

1.2K
Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...
1.2K
Issues And Trends In Healthcare Delivery System01:29

Issues And Trends In Healthcare Delivery System

6.0K
The issues and trends in healthcare delivery are constantly changing. The COVID-19 pandemic is one recent issue that wreaked havoc on healthcare systems, causing a shortage of healthcare workers, high demand for medicines and supplies, and increased medical expenditure due to a lack of insurance. Other issues include rising healthcare costs and care fragmentation.
Cost Containment
Payment for healthcare services has historically promoted adoption of costly and often unnecessary or inefficient...
6.0K
Methods of Documentation VII: EMR01:30

Methods of Documentation VII: EMR

1.3K
Electronic Medical Records (EMRs) primarily center around electronically documenting patients' health information within a single healthcare organization or practice. They contain essential clinical data related to a patient's medical history, diagnoses, medications, treatment plans, lab results, and other pertinent information relevant to the specific encounter or episode of care. EMRs are designed to streamline documentation and workflow processes within individual healthcare...
1.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Social Determinants of Health and Continuous Glucose Monitoring Metrics in Type 1 or Type 2 Diabetes.

JAMA network open·2026
Same author

Association of White Blood Cell Count with Treatment Response to Cefepime vs Piperacillin-Tazobactam.

American journal of respiratory and critical care medicine·2026
Same author

Continuing Vs. Withholding Home Beta-Blockers at Admission for Suspected Infection: A Target Trial Emulation.

Critical care medicine·2026
Same author

Mind the Performance Gap: Examining Dataset Shift During Prospective Validation.

Proceedings of machine learning research·2026
Same author

Prone Positioning in a North American Cohort of Hypoxemic Patients on Mechanical Ventilation.

Critical care medicine·2026
Same author

Reply to Hamilton et al. and chen et al.

American journal of respiratory and critical care medicine·2026
Same journal

Extending the fundamental theorem of biomedical informatics: a proposal and illustrative examples.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Human factors methods for designing safe health information technology: what do the experts think?

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Equity-by-design for socially assistive robots as digital health tools.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Orchestrator multi-agent clinical decision support system for secondary headache diagnosis in primary care.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

CUI-Curate: a GraphRAG-based framework for automated clinical concept curation for NLP applications.

Journal of the American Medical Informatics Association : JAMIA·2026
Same journal

Malfunctions in distributed clinical decision support: 3 cases from a multi‑component clinical decision support system.

Journal of the American Medical Informatics Association : JAMIA·2026
See all related articles

Related Experiment Video

Updated: Dec 6, 2025

PIPEMAT-RS: Development and Validation of a Standardized MATLAB Pipeline for Resting-State EEG Preprocessing
06:51

PIPEMAT-RS: Development and Validation of a Standardized MATLAB Pipeline for Resting-State EEG Preprocessing

Published on: June 6, 2025

728

Democratizing EHR analyses with FIDDLE: a flexible data-driven preprocessing pipeline for structured clinical data.

Shengpu Tang1, Parmida Davarmanesh2, Yanmeng Song3

  • 1Department of Electrical Engineering and Computer Science, Division of Computer Science and Engineering, University of Michigan, Ann Arbor, USA.

Journal of the American Medical Informatics Association : JAMIA
|October 11, 2020
PubMed
Summary
This summary is machine-generated.

FIDDLE is an open-source framework that streamlines machine learning preprocessing for electronic health record data. This flexible pipeline accelerates feature extraction and model building, aiding the development of clinical ML tools.

Keywords:
electronic health recordsmachine learningpreprocessing pipeline

More Related Videos

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

7.4K
Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

3.7K

Related Experiment Videos

Last Updated: Dec 6, 2025

PIPEMAT-RS: Development and Validation of a Standardized MATLAB Pipeline for Resting-State EEG Preprocessing
06:51

PIPEMAT-RS: Development and Validation of a Standardized MATLAB Pipeline for Resting-State EEG Preprocessing

Published on: June 6, 2025

728
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack
07:31

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

7.4K
Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

3.7K

Area of Science:

  • Health Informatics
  • Machine Learning in Healthcare
  • Data Preprocessing

Background:

  • Machine learning (ML) applied to electronic health record (EHR) data requires extensive, labor-intensive preprocessing.
  • There is a growing need for systematic and reproducible EHR data preprocessing techniques.
  • The development of effective ML tools for healthcare is hindered by preprocessing challenges.

Purpose of the Study:

  • To introduce FIDDLE (Flexible Data-Driven Pipeline), an open-source framework designed to streamline EHR data preprocessing.
  • To demonstrate the utility and flexibility of FIDDLE in preparing EHR data for ML applications.
  • To reduce the manual effort and decision-making involved in EHR data preprocessing.

Main Methods:

  • FIDDLE systematically transforms structured EHR data into feature vectors in a largely data-driven manner.
  • The framework was applied to two public EHR datasets (MIMIC-III and eICU) from intensive care units.
  • ML models were trained to predict in-hospital mortality, acute respiratory failure, and shock, with performance evaluated using AUROC.

Main Results:

  • FIDDLE extracted a significant number of features (2,528–7,403) from the EHR datasets.
  • Models utilizing FIDDLE achieved strong discriminative performance (AUROCs of 0.757–0.886), comparable to existing methods.
  • FIDDLE demonstrated generalizability across different prediction times, ML algorithms, datasets, and robustness to user-defined arguments.

Conclusions:

  • FIDDLE is an open-source pipeline that simplifies the application of ML to structured EHR data.
  • By standardizing and accelerating preprocessing, FIDDLE can advance the creation of clinically relevant ML tools.
  • The framework facilitates progress in leveraging EHR data for improved healthcare outcomes.