Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Enhancing Query Formulation for Universal Image Segmentation.

Sensors (Basel, Switzerland)·2024

Same author

Voxel Transformer with Density-Aware Deformable Attention for 3D Object Detection.

Sensors (Basel, Switzerland)·2023

Same author

Enhancing Mask Transformer with Auxiliary Convolution Layers for Semantic Segmentation.

Sensors (Basel, Switzerland)·2023

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 31, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Efficient Multi-Task Training with Adaptive Feature Alignment for Universal Image Segmentation.

Yipeng Qu¹, Joohee Kim¹

¹Department of Electrical and Computer Engineering, Illinois Institute of Technology, Chicago, IL 60616, USA.

Sensors (Basel, Switzerland)

|January 25, 2025

Summary

This summary is machine-generated.

This study introduces a novel Adaptive Feature Alignment (AFA) method with a learnable task token for universal image segmentation. This approach effectively captures task differences, improving model efficiency and performance over text-based tokens.

Keywords:

computer vision feature alignment multimodal learning universal image segmentation

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

Related Experiment Videos

Last Updated: May 31, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Universal image segmentation models aim for single architecture, multi-task training.
Current methods use text-based task tokens, which lack inherent task differentiation and cause modality discrepancies.
Existing alignment methods are computationally expensive and unsuitable for resource-constrained devices.

Purpose of the Study:

To propose an Adaptive Feature Alignment (AFA) method with a learnable task token for universal image segmentation.
To address limitations of text-based task tokens and complex modality alignment methods.
To enhance efficiency and effectiveness in lightweight segmentation models.

Main Methods:

Introduced a learnable task token that automatically captures inter-task differences from image features and text queries.
Developed Adaptive Feature Alignment (AFA) by replacing image features with class-specific means for efficient cross-modal alignment.
Integrated the learnable task token with AFA for unified multi-task training.

Main Results:

The proposed model with AFA and learnable task token demonstrated superior efficiency and effectiveness compared to baseline models.
Achieved state-of-the-art performance on ADE20K and Cityscapes datasets with comparable parameter counts.
Showcased the advantage of learnable task tokens over predefined text-based tokens in capturing task nuances.

Conclusions:

The AFA method with a learnable task token offers a more effective and efficient solution for universal image segmentation.
This approach overcomes the limitations of text-based conditioning and complex alignment strategies.
Enables high-performance segmentation on resource-constrained devices.