Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A data-centric perspective on designing AI foundation models for healthcare.

Frontiers in digital health·2026
Same author

Multimodal Deep Learning for Stroke Prediction and Detection using Retinal Imaging and Clinical Data.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Meta-Repository of Screening Mammography Classifiers.

Studies in health technology and informatics·2025
Same author

Multimodal Machine Learning for Stroke Prognosis and Diagnosis: A Systematic Review.

IEEE journal of biomedical and health informatics·2024
Same author

Machine Learning Models Versus the National Early Warning Score System for Predicting Deterioration: Retrospective Cohort Study in the United Arab Emirates.

JMIR AI·2024
Same author

Development and validation of a parsimonious prediction model for positive urine cultures in outpatient visits.

PLOS digital health·2023
Same journal

Correction: A method for supervoxel-wise association studies of age and other non-imaging variables from coronary computed tomography angiograms.

Scientific reports·2026
Same journal

Poly(bromophenol blue)/CoSn(OH)<sub>6</sub> cubic particles modified pencil graphite electrode for electrochemical determination of diphenhydramine.

Scientific reports·2026
Same journal

Dietary Chlorella, Spirulina, and acidifier modulate jejunal cytokine-related gene expression in broiler chickens.

Scientific reports·2026
Same journal

Perceived physical activity barriers in university students: associations with fatigue and eating behaviours.

Scientific reports·2026
Same journal

Refuge limitation structures habitat use in agricultural landscapes: evidence from Sunda pangolins.

Scientific reports·2026
Same journal

Lightweight stateless transaction verification with outsourced witness updates for UTXO blockchains.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jun 11, 2025

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.7K

Multimodal masked siamese network improves chest X-ray representation learning.

Saeed Shurrab1, Alejandro Guerra-Manzanares1, Farah E Shamout2

  • 1New York University Abu Dhabi, Computer Engineering, Abu Dhabi, 129188, UAE.

Scientific Reports
|September 28, 2024
PubMed
Summary
This summary is machine-generated.

This study enhances self-supervised learning for chest X-rays by integrating Electronic Health Records (EHR) data. This multimodal approach significantly improves image representation quality and diagnostic performance.

More Related Videos

Multi-modal Pulmonary Imaging: Using Complementary Information from CT and Hyperpolarized 129Xe MRI to Evaluate Lung Structure-Function
02:09

Multi-modal Pulmonary Imaging: Using Complementary Information from CT and Hyperpolarized 129Xe MRI to Evaluate Lung Structure-Function

Published on: April 12, 2024

541
Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

376

Related Experiment Videos

Last Updated: Jun 11, 2025

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.7K
Multi-modal Pulmonary Imaging: Using Complementary Information from CT and Hyperpolarized 129Xe MRI to Evaluate Lung Structure-Function
02:09

Multi-modal Pulmonary Imaging: Using Complementary Information from CT and Hyperpolarized 129Xe MRI to Evaluate Lung Structure-Function

Published on: April 12, 2024

541
Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

376

Area of Science:

  • Artificial Intelligence in Medical Imaging
  • Machine Learning for Healthcare
  • Radiology Informatics

Background:

  • Current self-supervised learning for medical images often overlooks valuable Electronic Health Records (EHR) data.
  • Integrating diverse patient and scan information can potentially improve model performance.

Purpose of the Study:

  • To develop a multimodal pretraining strategy for chest radiographs incorporating EHR data.
  • To enhance the quality of chest radiograph representations using a Masked Siamese Network (MSN).

Main Methods:

  • Proposed a multimodal Masked Siamese Network (MSN) integrating EHR data (demographics, scan metadata, inpatient stay) with chest radiographs.
  • Evaluated the approach on MIMIC-CXR, CheXpert, and NIH-14 datasets using ViT-Tiny and ViT-Small backbones.
  • Assessed representation quality via linear evaluation and Area Under the Receiver Operating Characteristic Curve (AUROC).

Main Results:

  • The multimodal MSN significantly improved representation quality over vanilla MSN and state-of-the-art baselines.
  • Achieved 2% AUROC improvement over vanilla MSN and 5-8% over other baselines.
  • Demographic features demonstrated the most substantial performance enhancement.

Conclusions:

  • EHR-enhanced self-supervised pretraining offers a promising avenue for improving medical imaging analysis.
  • This multimodal strategy can advance representation learning for various medical imaging modalities.
  • Future research can explore this approach for neuroimaging, ophthalmic imaging, and sonar imaging.