Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Artificial intelligence for detecting fetal orofacial clefts and advancing medical education.

Nature communications·2026

Same author

Multi-omic analysis of deep learning-derived phenotypes links ophthalmic imaging to cardiovascular and neurological traits.

Nature cardiovascular research·2026

Same author

From pixels to polygons: A survey of deep learning approaches for medical image-to-mesh reconstruction.

Medical image analysis·2026

Same author

Cumulative burden of maternal vascular malperfusion and its association with early cerebral oxygenation in neonates.

Frontiers in cell and developmental biology·2026

Same author

OnUVS: An Online Motion Transfer Framework with Content-Texture Decoupling for High-Fidelity Ultrasound Video Synthesis.

IEEE journal of biomedical and health informatics·2026

Same author

Development of a core outcome set for intraventricular haemorrhage (IVH) in preterm infants: a study protocol.

BMJ open·2026

Same journal

A Multi-Head Attention Transformer Model for Wearable in Situ Fall Detection.

IEEE access : practical innovations, open solutions·2026

Same journal

Validating Single-Camera Pose Estimation Against Multi-Camera Motion Capture for Accessible Biomechanical Assessment.

IEEE access : practical innovations, open solutions·2026

Same journal

Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification.

IEEE access : practical innovations, open solutions·2026

Same journal

Radio-Frequency Toroid Susceptometry of Magnetic Nanoparticles: What Goes Around Comes Around.

IEEE access : practical innovations, open solutions·2026

Same journal

Cross-Architecture Knowledge Distillation for Histopathological Image Analysis.

IEEE access : practical innovations, open solutions·2026

Same journal

Mislabel Identification Using Transfer Learning-Based Ensemble Method.

IEEE access : practical innovations, open solutions·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 21, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Medical Image Segmentation Using Transformer Networks.

Davood Karimi¹, Haoran Dou², Ali Gholipour¹

¹Department of Radiology, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA.

IEEE Access : Practical Innovations, Open Solutions

|June 3, 2022

Summary

This summary is machine-generated.

A novel deep neural network architecture, based on self-attention rather than convolutions, achieves superior medical image segmentation accuracy. This transformer-based model excels even with limited labeled data, outperforming traditional fully-convolutional networks (FCNs).

Keywords:

Deep learning medical image segmentation self-attention transformer networks

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Related Experiment Videos

Last Updated: Sep 21, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Area of Science:

Artificial Intelligence
Medical Imaging
Computer Vision

Background:

Deep learning, particularly fully-convolutional networks (FCNs), dominates medical image segmentation.
FCNs utilize convolution operations, offering benefits like parameter sharing but struggling with long-range interactions and content-agnostic processing.
Limitations of FCNs include fixed operations and difficulty modeling long-range dependencies in medical images.

Purpose of the Study:

To introduce a novel deep neural network architecture for medical image segmentation that overcomes FCN limitations.
To demonstrate the efficacy of a transformer-based, self-attention mechanism for segmentation tasks.
To address the challenge of limited labeled medical data through effective pre-training strategies.

Main Methods:

A transformer network architecture is proposed, processing 3D image blocks by dividing them into patches.
Self-attention mechanisms are computed between 1D embeddings of neighboring image patches for segmentation prediction.
Methods for pre-training the model on large unlabeled medical image datasets are developed to mitigate data scarcity.

Main Results:

The proposed self-attention-based model achieved higher segmentation accuracies than state-of-the-art FCNs on two benchmark datasets.
The network demonstrated strong performance when trained with very few labeled images (tens).
Pre-training strategies significantly enhanced the model's performance, especially in low-data regimes, outperforming FCNs.

Conclusions:

A novel, convolution-free, transformer-based architecture offers a more accurate approach to medical image segmentation.
The model's reliance on self-attention effectively captures necessary image context, surpassing FCN capabilities.
The proposed pre-training methods make the model highly effective even with scarce labeled medical imaging data.