Jove
Visualize
Contact Us
  1. Home
  2. Research Domains
  3. Information And Computing Sciences
  4. Computer Vision And Multimedia Computation
  5. Video Processing
  6. Uav-detr: An Enhanced Rt-detr Architecture For Efficient Small Object Detection In Uav Imagery

UAV-DETR: An Enhanced RT-DETR Architecture for Efficient Small Object Detection in UAV Imagery

Yu Zhou1, Yan Wei1

  • 1College of Computer and Information Science, Chongqing Normal University, Chongqing 401331, China.

Sensors (Basel, Switzerland)
|August 14, 2025

Related Experiment Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

635
Electroantennography-based Bio-hybrid Odor-detecting Drone using Silkmoth Antennae for Odor Source Localization
06:00

Electroantennography-based Bio-hybrid Odor-detecting Drone using Silkmoth Antennae for Odor Source Localization

Published on: August 27, 2021

5.4K
Visually Mediated Odor Tracking During Flight in Drosophila
08:50

Visually Mediated Odor Tracking During Flight in Drosophila

Published on: January 26, 2009

10.0K

View abstract on PubMed

Summary
This summary is machine-generated.

UAV-DETR enhances object detection in aerial imagery by improving feature perception and spatial alignment, achieving superior small-object detection performance with fewer parameters.

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • Aerial imagery from unmanned aerial vehicles (UAVs) presents unique challenges for object detection, including feature degradation and spatial-contextual misalignment.
  • Low resolution, complex backgrounds, and dynamic shooting conditions in UAV data hinder accurate detection of small objects.

Purpose of the Study:

  • To propose UAV-DETR, an enhanced Transformer-based object detection model specifically designed to address the challenges of aerial imagery.
  • To improve feature perception, semantic representation, and spatial alignment for more robust small-object detection in UAV-acquired data.

Main Methods:

  • UAV-DETR extends the RT-DETR framework with three novel modules: Channel-Aware Sensing (CAS), Scale-Optimized Enhancement Pyramid (SOEP), and Context-Spatial Alignment (CSAM).
  • CAS refines the backbone for better multi-scale feature perception.
  • SOEP enhances shallow layer semantic richness via channel-weighted fusion.
  • CSAM optimizes the hybrid encoder for contextual and spatial calibration, improving cross-scale integration.

Main Results:

  • UAV-DETR demonstrates superior small-object detection performance on the VisDrone2019 dataset, achieving an mAP@0.5 of 51.6%.
  • The model outperforms the baseline RT-DETR by 3.5% in mAP@0.5.
  • UAV-DETR reduces model parameters from 19.8 million to 16.8 million compared to RT-DETR, indicating improved efficiency.

Conclusions:

  • The proposed UAV-DETR effectively mitigates technical challenges in UAV-based object detection, particularly for small objects.
  • The novel modules (CAS, SOEP, CSAM) significantly enhance feature perception, semantic representation, and spatial-contextual alignment.
  • UAV-DETR offers a promising solution for accurate and efficient object detection in complex aerial scenarios.
Keywords:
RT-DETRUAV imageryfeature fusionsmall object detection

Related Experiment Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

635
Electroantennography-based Bio-hybrid Odor-detecting Drone using Silkmoth Antennae for Odor Source Localization
06:00

Electroantennography-based Bio-hybrid Odor-detecting Drone using Silkmoth Antennae for Odor Source Localization

Published on: August 27, 2021

5.4K
Visually Mediated Odor Tracking During Flight in Drosophila
08:50

Visually Mediated Odor Tracking During Flight in Drosophila

Published on: January 26, 2009

10.0K

Related Concept Videos

Difference from Background: Limit of Detection01:05

Difference from Background: Limit of Detection

7.1K
The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...
7.1K
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies