Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Gestalt Principles of Perception

Gestalt Principles of Perception

Gestalt principles provide a framework for understanding how humans perceive objects as unified wholes within their context. These principles are essential in explaining the cognitive processes that make sense of complex visual stimuli by organizing them into coherent groups. One fundamental principle is proximity, which posits that objects located close to each other are perceived as a collective group. For instance, when dots are positioned near one another, the visual system interprets them...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Genomic characterization of a large-scale chikungunya outbreak in China.

The Journal of infection·2026

Same author

Communication Delay-Based Under-Actuated MASVs Distributed Formation Tracking Control With Unknown Ocean Disturbances and Input Quantization.

IEEE transactions on cybernetics·2026

Same author

<i>Acinetobacter pengchengensis</i> sp. nov., isolated from the urban wastewater of Shenzhen, Guangdong Province, China.

International journal of systematic and evolutionary microbiology·2026

Same author

Contextual Style Coherence Network for X-Ray Prohibited Item Image Synthesis.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Vacuum-stabilized lung window enables real-time and simultaneous imaging of vascular and calcium responses to hypoxia in vivo.

Respiratory research·2026

Same author

Global and Local Visual-Textual Alignment for Open Vocabulary Object Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 7, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Explainability Enhanced Object Detection Transformer With Feature Disentanglement.

Wenlong Yu, Ruonan Liu, Dongyue Chen

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|November 12, 2024

Summary

This summary is machine-generated.

We developed a novel disentanglement method for deep learning object detection models to improve explainability. This approach enhances feature learning and interpretability in critical applications.

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Related Experiment Videos

Last Updated: Jun 7, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Area of Science:

Computer Vision
Deep Learning
Artificial Intelligence

Background:

Explainability is crucial for deploying deep learning models in critical applications.
Existing object detection models, particularly DETR variants, suffer from entangled features, hindering interpretability.
The regression function in object detection contributes to feature entanglement and reduced semantic coverage.

Purpose of the Study:

To enhance the explainability of end-to-end object detection with Transformer (DETR) models.
To introduce a feature disentanglement method to improve model interpretability and performance.
To address the limitations of entangled features in deep learning-based object detection.

Main Methods:

A divide-and-conquer decoupling paradigm was employed for feature learning.
Tensor Singular Value Decomposition (T-SVD) was utilized to generate feature bases.
Batch averaged Feature Spectral Penalization (BFSP) loss was introduced to constrain feature disentanglement and balance semantic activation.

Main Results:

The proposed Explainability Enhanced object detection Transformer with feature Disentanglement (DETD) model demonstrated improved object detection performance.
Consistent outperformance was observed across various backbones and DETR variants on two datasets.
Grad-CAM visualizations confirmed enhanced feature learning explainability through feature disentanglement.

Conclusions:

The DETD model effectively disentangles features, leading to enhanced explainability in object detection.
The proposed method improves both detection performance and feature interpretability.
This work offers a pathway for more trustworthy and interpretable deep learning models in critical domains.