Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Artificial intelligence for detecting fetal orofacial clefts and advancing medical education.

Nature communications·2026
Same author

Multi-omic analysis of deep learning-derived phenotypes links ophthalmic imaging to cardiovascular and neurological traits.

Nature cardiovascular research·2026
Same author

From pixels to polygons: A survey of deep learning approaches for medical image-to-mesh reconstruction.

Medical image analysis·2026
Same author

Cumulative burden of maternal vascular malperfusion and its association with early cerebral oxygenation in neonates.

Frontiers in cell and developmental biology·2026
Same author

OnUVS: An Online Motion Transfer Framework with Content-Texture Decoupling for High-Fidelity Ultrasound Video Synthesis.

IEEE journal of biomedical and health informatics·2026
Same author

Development of a core outcome set for intraventricular haemorrhage (IVH) in preterm infants: a study protocol.

BMJ open·2026
Same journal

A Multi-Head Attention Transformer Model for Wearable in Situ Fall Detection.

IEEE access : practical innovations, open solutions·2026
Same journal

Validating Single-Camera Pose Estimation Against Multi-Camera Motion Capture for Accessible Biomechanical Assessment.

IEEE access : practical innovations, open solutions·2026
Same journal

Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification.

IEEE access : practical innovations, open solutions·2026
Same journal

Radio-Frequency Toroid Susceptometry of Magnetic Nanoparticles: What Goes Around Comes Around.

IEEE access : practical innovations, open solutions·2026
Same journal

Cross-Architecture Knowledge Distillation for Histopathological Image Analysis.

IEEE access : practical innovations, open solutions·2026
Same journal

Mislabel Identification Using Transfer Learning-Based Ensemble Method.

IEEE access : practical innovations, open solutions·2026
See all related articles

Related Experiment Video

Updated: Sep 21, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

530

Medical Image Segmentation Using Transformer Networks.

Davood Karimi1, Haoran Dou2, Ali Gholipour1

  • 1Department of Radiology, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA.

IEEE Access : Practical Innovations, Open Solutions
|June 3, 2022
PubMed
Summary
This summary is machine-generated.

A novel deep neural network architecture, based on self-attention rather than convolutions, achieves superior medical image segmentation accuracy. This transformer-based model excels even with limited labeled data, outperforming traditional fully-convolutional networks (FCNs).

Keywords:
Deep learningmedical image segmentationself-attentiontransformer networks

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

2.0K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.0K

Related Experiment Videos

Last Updated: Sep 21, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

530
A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

2.0K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.0K

Area of Science:

  • Artificial Intelligence
  • Medical Imaging
  • Computer Vision

Background:

  • Deep learning, particularly fully-convolutional networks (FCNs), dominates medical image segmentation.
  • FCNs utilize convolution operations, offering benefits like parameter sharing but struggling with long-range interactions and content-agnostic processing.
  • Limitations of FCNs include fixed operations and difficulty modeling long-range dependencies in medical images.

Purpose of the Study:

  • To introduce a novel deep neural network architecture for medical image segmentation that overcomes FCN limitations.
  • To demonstrate the efficacy of a transformer-based, self-attention mechanism for segmentation tasks.
  • To address the challenge of limited labeled medical data through effective pre-training strategies.

Main Methods:

  • A transformer network architecture is proposed, processing 3D image blocks by dividing them into patches.
  • Self-attention mechanisms are computed between 1D embeddings of neighboring image patches for segmentation prediction.
  • Methods for pre-training the model on large unlabeled medical image datasets are developed to mitigate data scarcity.

Main Results:

  • The proposed self-attention-based model achieved higher segmentation accuracies than state-of-the-art FCNs on two benchmark datasets.
  • The network demonstrated strong performance when trained with very few labeled images (tens).
  • Pre-training strategies significantly enhanced the model's performance, especially in low-data regimes, outperforming FCNs.

Conclusions:

  • A novel, convolution-free, transformer-based architecture offers a more accurate approach to medical image segmentation.
  • The model's reliance on self-attention effectively captures necessary image context, surpassing FCN capabilities.
  • The proposed pre-training methods make the model highly effective even with scarce labeled medical imaging data.