Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Enhancing Query Formulation for Universal Image Segmentation.

Sensors (Basel, Switzerland)·2024
Same author

Voxel Transformer with Density-Aware Deformable Attention for 3D Object Detection.

Sensors (Basel, Switzerland)·2023
Same author

Enhancing Mask Transformer with Auxiliary Convolution Layers for Semantic Segmentation.

Sensors (Basel, Switzerland)·2023
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: May 31, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

363

Efficient Multi-Task Training with Adaptive Feature Alignment for Universal Image Segmentation.

Yipeng Qu1, Joohee Kim1

  • 1Department of Electrical and Computer Engineering, Illinois Institute of Technology, Chicago, IL 60616, USA.

Sensors (Basel, Switzerland)
|January 25, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a novel Adaptive Feature Alignment (AFA) method with a learnable task token for universal image segmentation. This approach effectively captures task differences, improving model efficiency and performance over text-based tokens.

Keywords:
computer visionfeature alignmentmultimodal learninguniversal image segmentation

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.6K
Deep Learning-Based Segmentation of Cryo-Electron Tomograms
10:25

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

8.5K

Related Experiment Videos

Last Updated: May 31, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

363
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.6K
Deep Learning-Based Segmentation of Cryo-Electron Tomograms
10:25

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

8.5K

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • Universal image segmentation models aim for single architecture, multi-task training.
  • Current methods use text-based task tokens, which lack inherent task differentiation and cause modality discrepancies.
  • Existing alignment methods are computationally expensive and unsuitable for resource-constrained devices.

Purpose of the Study:

  • To propose an Adaptive Feature Alignment (AFA) method with a learnable task token for universal image segmentation.
  • To address limitations of text-based task tokens and complex modality alignment methods.
  • To enhance efficiency and effectiveness in lightweight segmentation models.

Main Methods:

  • Introduced a learnable task token that automatically captures inter-task differences from image features and text queries.
  • Developed Adaptive Feature Alignment (AFA) by replacing image features with class-specific means for efficient cross-modal alignment.
  • Integrated the learnable task token with AFA for unified multi-task training.

Main Results:

  • The proposed model with AFA and learnable task token demonstrated superior efficiency and effectiveness compared to baseline models.
  • Achieved state-of-the-art performance on ADE20K and Cityscapes datasets with comparable parameter counts.
  • Showcased the advantage of learnable task tokens over predefined text-based tokens in capturing task nuances.

Conclusions:

  • The AFA method with a learnable task token offers a more effective and efficient solution for universal image segmentation.
  • This approach overcomes the limitations of text-based conditioning and complex alignment strategies.
  • Enables high-performance segmentation on resource-constrained devices.