Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: May 10, 2026

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

Information Extraction for Clinical Data Mining: A Mammography Case Study.

Houssam Nassif1, Ryan Woods, Elizabeth Burnside

  • 1Department of Computer Sciences, University of Wisconsin-Madison, USA ; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, USA.

Proceedings. IEEE International Conference on Data Mining
|June 15, 2013
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Age, morphology, and environmental variation shape movement behaviour syndromes in a riverine fish - golden perch (Macquaria ambigua).

Movement ecology·2026
Same author

Rocuronium Dose and First-Attempt Intubation Success in the Critically Ill: Secondary Analysis of Two Multicenter Trials.

American journal of respiratory and critical care medicine·2026
Same author

United Global Advocacy Drives Updates to World Health Organization Essential Medicines List.

Haemophilia : the official journal of the World Federation of Hemophilia·2026
Same author

Impact of measurable residual disease on outcomes using a modified DFCI protocol for adults with BCR-ABL negative acute lymphoblastic leukemia.

Leukemia research·2026
Same author

Stratifying Risk and Treatment Benefit: A Model Predicting Overall Survival in Men with Metastatic De Novo Hormone-sensitive Prostate Cancer in Trials Investigating Docetaxel (the STOPCAP Collaboration).

European urology focus·2026
Same author

Proposed Modifications to Prognostic Classification of AML Patients Treated With Intensive Chemotherapy Based on Recent Real-World Data.

Clinical lymphoma, myeloma & leukemia·2025
Same journal

GeoToken: Hierarchical Geolocalization of Images via Next Token Prediction.

Proceedings. IEEE International Conference on Data Mining·2026
Same journal

A Universal Metric of Dataset Similarity for Cross-silo Federated Learning.

Proceedings. IEEE International Conference on Data Mining·2026
Same journal

Heterogeneous Treatment Effect Estimation with Subpopulation Identification for Personalized Medicine in Opioid Use Disorder.

Proceedings. IEEE International Conference on Data Mining·2024
Same journal

Enhancing Personalized Healthcare via Capturing Disease Severity, Interaction, and Progression.

Proceedings. IEEE International Conference on Data Mining·2024
Same journal

Robust Unsupervised Domain Adaptation from A Corrupted Source.

Proceedings. IEEE International Conference on Data Mining·2023
Same journal

RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging.

Proceedings. IEEE International Conference on Data Mining·2023
See all related articles

This study introduces an algorithm to extract Breast Imaging Reporting and Data System (BI-RADS) features from mammography reports, improving data quality for machine learning models in breast cancer detection.

Area of Science:

  • Medical Informatics
  • Radiology
  • Data Mining

Background:

  • Breast cancer is a leading cause of mortality in women, with early detection crucial for survival.
  • Machine learning models for breast cancer detection rely on high-quality mammography data.
  • Existing databases often have data inconsistencies, missing information, and non-standard formats, hindering model accuracy.

Purpose of the Study:

  • To develop and validate an automated algorithm for extracting BI-RADS features from free-text mammography reports.
  • To address data quality issues by standardizing feature extraction and enabling consistency checks.
  • To improve the reliability of data used for training and validating breast cancer detection models.

Main Methods:

  • Developed a BI-RADS feature extraction algorithm comprising a syntax analyzer, concept finder, and negation detector.
Keywords:
BI-RADSclinical data miningfree textlexiconmammography

More Related Videos

Clinical Imaging of Microwave Mammography
05:28

Clinical Imaging of Microwave Mammography

Published on: November 14, 2025

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

Related Experiment Videos

Last Updated: May 10, 2026

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

Clinical Imaging of Microwave Mammography
05:28

Clinical Imaging of Microwave Mammography

Published on: November 14, 2025

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

  • Utilized a semantic grammar based on the BI-RADS lexicon and expert input for concept identification.
  • Implemented a lexical scanner to detect negation within the text, handling multiple concepts and filtering irrelevant information.
  • Main Results:

    • The algorithm achieved high performance metrics on a test dataset: 97.7% precision, 95.5% recall, and an F1-score of 0.97.
    • Demonstrated superior performance compared to manual feature extraction at a 5% statistical significance level.
    • Successfully extracted BI-RADS concepts from free text, enhancing data standardization for clinical data mining.

    Conclusions:

    • The proposed algorithm effectively extracts BI-RADS features from free-text mammography reports, significantly improving data quality.
    • This automated approach enhances the reliability and consistency of data for machine learning applications in breast cancer.
    • The method offers a robust solution for overcoming data limitations in clinical databases, supporting more accurate malignancy detection.