Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Understanding older adults' perception, acceptance, and adoption of smart home technologies.

PloS one·2026
Same author

BioFuse: an embedding fusion framework for biomedical foundation models.

PloS one·2026
Same author

Exploring techniques to distinguish between real images and those generated using stable diffusion XL.

PloS one·2026
Same author

Endometrial whole-slide images dataset for detection of malignancy in endometrial biopsies.

GigaScience·2025
Same author

Cervical whole-slide images dataset for multiclass classification.

GigaScience·2025
Same author

SurGen: 1020 H&E-stained whole-slide images with survival and genetic markers.

GigaScience·2025
Same journal

Time-series prediction of adverse birth outcomes in the U.S. using multilayer perceptron neural networks.

PLOS digital health·2026
Same journal

Exploring attitudes and acceptance of artificial intelligence in multiple sclerosis from the patient perspective.

PLOS digital health·2026
Same journal

A qualitative exploration of the low-resource tele-assisted home exercise program for balance and functional mobility in Parkinson's disease (TELEPORT-PD).

PLOS digital health·2026
Same journal

A single-camera video-based assessment of locomotive syndrome using pose-silhouette fusion model.

PLOS digital health·2026
Same journal

The organization of virtual care centers: A qualitative study in Dutch hospitals.

PLOS digital health·2026
Same journal

A conceptual agentic AI architecture for MASLD-associated significant fibrosis in primary care.

PLOS digital health·2026
See all related articles

Related Experiment Video

Updated: Jul 8, 2025

Author Spotlight: An Efficient and Robust Software for Automated Fusion of Multiple Preclinical Imaging Modalities
07:13

Author Spotlight: An Efficient and Robust Software for Automated Fusion of Multiple Preclinical Imaging Modalities

Published on: October 27, 2023

1.2K

Classification of hyper-scale multimodal imaging datasets.

Craig Macfadyen1, Ajay Duraiswamy1, David Harris-Birtill1

  • 1University of St Andrews, St Andrews, United Kingdom.

PLOS Digital Health
|December 13, 2023
PubMed
Summary
This summary is machine-generated.

Deep learning models can accurately classify millions of medical images by modality (CT, MRI, PET, X-ray). This accelerates retrieval of diagnostic imaging data, improving clinical outcomes with 96% accuracy.

More Related Videos

Multimodal Hierarchical Imaging of Serial Sections for Finding Specific Cellular Targets within Large Volumes
11:19

Multimodal Hierarchical Imaging of Serial Sections for Finding Specific Cellular Targets within Large Volumes

Published on: March 20, 2018

10.5K
Multimodal Optical Imaging Platform for Studying Cellular Metabolism
04:47

Multimodal Optical Imaging Platform for Studying Cellular Metabolism

Published on: June 6, 2025

332

Related Experiment Videos

Last Updated: Jul 8, 2025

Author Spotlight: An Efficient and Robust Software for Automated Fusion of Multiple Preclinical Imaging Modalities
07:13

Author Spotlight: An Efficient and Robust Software for Automated Fusion of Multiple Preclinical Imaging Modalities

Published on: October 27, 2023

1.2K
Multimodal Hierarchical Imaging of Serial Sections for Finding Specific Cellular Targets within Large Volumes
11:19

Multimodal Hierarchical Imaging of Serial Sections for Finding Specific Cellular Targets within Large Volumes

Published on: March 20, 2018

10.5K
Multimodal Optical Imaging Platform for Studying Cellular Metabolism
04:47

Multimodal Optical Imaging Platform for Studying Cellular Metabolism

Published on: June 6, 2025

332

Area of Science:

  • Artificial Intelligence
  • Medical Imaging
  • Computer Science

Background:

  • Hyper-scale multi-modal datasets, containing millions of images, require efficient classification methods for diagnostic imaging data retrieval.
  • Accurate modality classification of medical images is crucial for accelerating clinical outcomes and research.

Purpose of the Study:

  • To demonstrate the efficacy of deep neural networks trained on a hyper-scale, multi-modal dataset for accurate medical image modality classification.
  • To evaluate the performance of different Convolutional Neural Network (CNN) architectures (ResNet-50, ResNet-18, VGG16) in classifying medical imaging modalities.

Main Methods:

  • A dataset of 4.5 million heterogeneous medical images was created by combining 102 diverse datasets.
  • ResNet-50, ResNet-18, and VGG16 models were trained to classify images into four modalities: Computed Tomography (CT), Magnetic Resonance Imaging (MRI), Positron Emission Tomography (PET), and X-ray.
  • Model performance was assessed on unseen data, measuring classification accuracy and balanced accuracy.

Main Results:

  • The best performing model achieved a 96% classification accuracy on unseen data, comparable to or exceeding more complex models like EfficientNets and Vision Transformers (ViTs).
  • A balanced accuracy of 86% was achieved by the top model.
  • The study confirmed the feasibility of training deep learning CNNs on hyper-scale multimodal datasets.

Conclusions:

  • Deep learning models trained on hyper-scale multimodal datasets can achieve high accuracy in classifying medical imaging modalities.
  • These models have significant potential for real-world applications involving large-scale medical image repositories and national healthcare institutions.
  • Future research could extend this classification capability to include 3D scans.