Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Dual contextual learning for semi-supervised medical image classification.

Frontiers in medicine·2026
Same author

Effects of Halliwick-based aquatic exercise on social and motor skills of children with autism spectrum disorder: a pilot study.

Frontiers in psychology·2026
Same author

Ubiquitin-dependent mitochondrial protein degradation ensures seedling emergence by regulating ER-mitochondrial interaction and mitophagy.

Nature plants·2026
Same author

Secondary signs combined with renal pelvis urine density on preoperative CT as a predictor of infection following ureteroscopic lithotripsy with chronic kidney disease.

Polish journal of radiology·2026
Same author

Disc collapse aggravates vertebral recollapse following percutaneous kyphoplasty: a clinical review and biomechanical study.

Quantitative imaging in medicine and surgery·2026
Same author

A unified deep learning framework for cross-platform harmonization of multi-tracer PET quantification in neurodegenerative disease.

NPJ digital medicine·2026
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Sep 8, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

491

Deformation-Resilient Multigranularity Learning for Unaligned RGB-T Semantic Segmentation.

Heng Zhou, Zhenxi Zhang, Chengyang Li

    IEEE Transactions on Neural Networks and Learning Systems
    |July 8, 2025
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new benchmark and method for unaligned RGB-Thermal semantic segmentation, improving object mask accuracy by aligning features across modalities. The deformation-resilient multigranularity learning (DML) method effectively handles misaligned image pairs.

    More Related Videos

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    633
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.9K

    Related Experiment Videos

    Last Updated: Sep 8, 2025

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
    04:48

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

    Published on: July 5, 2024

    491
    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    633
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.9K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Image Processing

    Background:

    • Semantic segmentation (SS) typically requires aligned RGB-Thermal (RGB-T) image pairs.
    • Real-world RGB-T data is often unaligned, posing significant challenges for existing SS methods.
    • Pixel-level alignment of unaligned RGB-T images is computationally intensive and difficult.

    Purpose of the Study:

    • To address the challenge of unaligned RGB-T image pairs in semantic segmentation.
    • To introduce a novel benchmark dataset for unaligned RGB-T SS.
    • To propose a deformation-resilient multigranularity learning (DML) method for robust RGB-T SS.

    Main Methods:

    • Developed a new unaligned RGB-T SS benchmark dataset.
    • Proposed the deformation-resilient multigranularity learning (DML) method.
    • Introduced a deformation-aware complementary feature enhancer (DCFE) with deformation-aware feature alignment (DFA) and complementary feature aggregation (CFA) modules.
    • Designed a multigranularity mask refinement engine (MMFE) incorporating class-agnostic saliency prediction (CSP) and class-aware edge generation (CEG).

    Main Results:

    • The DML method effectively aligns multimodal features in a coarse-to-fine strategy, mitigating interference from warped modalities.
    • DFA estimates deformation fields for improved spatial alignment, while CFA aggregates complementary contextual information across scales.
    • MMFE enhances semantic alignment and inter-class separability, producing object masks with sharp boundaries.
    • Experiments demonstrated superior performance of DML on both aligned and unaligned datasets compared to existing methods.

    Conclusions:

    • The proposed DML method offers a robust solution for semantic segmentation using unaligned RGB-T image pairs.
    • The new benchmark facilitates research on handling real-world, misaligned multimodal imaging data.
    • DML achieves state-of-the-art results, highlighting the importance of addressing modality alignment in RGB-T SS.