Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Gestalt Principles of Perception01:21

Gestalt Principles of Perception

279
Gestalt principles provide a framework for understanding how humans perceive objects as unified wholes within their context. These principles are essential in explaining the cognitive processes that make sense of complex visual stimuli by organizing them into coherent groups. One fundamental principle is proximity, which posits that objects located close to each other are perceived as a collective group. For instance, when dots are positioned near one another, the visual system interprets them...
279

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Genomic characterization of a large-scale chikungunya outbreak in China.

The Journal of infection·2026
Same author

Communication Delay-Based Under-Actuated MASVs Distributed Formation Tracking Control With Unknown Ocean Disturbances and Input Quantization.

IEEE transactions on cybernetics·2026
Same author

<i>Acinetobacter pengchengensis</i> sp. nov., isolated from the urban wastewater of Shenzhen, Guangdong Province, China.

International journal of systematic and evolutionary microbiology·2026
Same author

Contextual Style Coherence Network for X-Ray Prohibited Item Image Synthesis.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Vacuum-stabilized lung window enables real-time and simultaneous imaging of vascular and calcium responses to hypoxia in vivo.

Respiratory research·2026
Same author

Global and Local Visual-Textual Alignment for Open Vocabulary Object Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Jun 7, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.7K

Explainability Enhanced Object Detection Transformer With Feature Disentanglement.

Wenlong Yu, Ruonan Liu, Dongyue Chen

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |November 12, 2024
    PubMed
    Summary
    This summary is machine-generated.

    We developed a novel disentanglement method for deep learning object detection models to improve explainability. This approach enhances feature learning and interpretability in critical applications.

    More Related Videos

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    471
    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
    04:48

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

    Published on: July 5, 2024

    369

    Related Experiment Videos

    Last Updated: Jun 7, 2025

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
    04:23

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

    Published on: April 21, 2023

    1.7K
    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    471
    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
    04:48

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

    Published on: July 5, 2024

    369

    Area of Science:

    • Computer Vision
    • Deep Learning
    • Artificial Intelligence

    Background:

    • Explainability is crucial for deploying deep learning models in critical applications.
    • Existing object detection models, particularly DETR variants, suffer from entangled features, hindering interpretability.
    • The regression function in object detection contributes to feature entanglement and reduced semantic coverage.

    Purpose of the Study:

    • To enhance the explainability of end-to-end object detection with Transformer (DETR) models.
    • To introduce a feature disentanglement method to improve model interpretability and performance.
    • To address the limitations of entangled features in deep learning-based object detection.

    Main Methods:

    • A divide-and-conquer decoupling paradigm was employed for feature learning.
    • Tensor Singular Value Decomposition (T-SVD) was utilized to generate feature bases.
    • Batch averaged Feature Spectral Penalization (BFSP) loss was introduced to constrain feature disentanglement and balance semantic activation.

    Main Results:

    • The proposed Explainability Enhanced object detection Transformer with feature Disentanglement (DETD) model demonstrated improved object detection performance.
    • Consistent outperformance was observed across various backbones and DETR variants on two datasets.
    • Grad-CAM visualizations confirmed enhanced feature learning explainability through feature disentanglement.

    Conclusions:

    • The DETD model effectively disentangles features, leading to enhanced explainability in object detection.
    • The proposed method improves both detection performance and feature interpretability.
    • This work offers a pathway for more trustworthy and interpretable deep learning models in critical domains.