Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Polybenzimidazole-based magnetic solid-phase extraction coupled with gas chromatography-mass spectrometry for the determination of triazine herbicides in environmental waters.

Journal of chromatography. A·2026

Same author

Repurposing the Antibiotic Tigecycline to Inhibit Tumor Growth and Hormone Secretion in Somatotroph Pituitary Neuroendocrine Tumors.

International journal of endocrinology·2026

Same author

Enhancing Underwater Light Field Images via Global Geometry-Aware Diffusion Process.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Regulating B-configuration in N-doped carbon to enhance H<sub>2</sub>O<sub>2</sub> electrosynthesis and Fe<sup>3+</sup>/Fe<sup>2+</sup> cycling for electro-Fenton water purification.

Journal of environmental management·2026

Same author

Cholangioscopy-guided diagnosis and management of biliary cast syndrome in a nontransplant patient.

VideoGIE : an official video journal of the American Society for Gastrointestinal Endoscopy·2026

Same author

SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation.

IEEE transactions on visualization and computer graphics·2026

Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 17, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection.

Yifan Zhang, Zhiyu Zhu, Junhui Hou

IEEE Transactions on Pattern Analysis and Machine Intelligence

|August 14, 2024

Summary

This summary is machine-generated.

This study introduces STEMD, a novel framework for multi-frame 3D object detection. It enhances Detection Transformer (DETR) models by improving spatial-temporal modeling and reducing redundant predictions for better performance.

More Related Videos

High-resolution, High-speed, Three-dimensional Video Imaging with Digital Fringe Projection Techniques

High-resolution, High-speed, Three-dimensional Video Imaging with Digital Fringe Projection Techniques

Published on: December 3, 2013

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Published on: October 18, 2024

Related Experiment Videos

Last Updated: Jun 17, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

High-resolution, High-speed, Three-dimensional Video Imaging with Digital Fringe Projection Techniques

High-resolution, High-speed, Three-dimensional Video Imaging with Digital Fringe Projection Techniques

Published on: December 3, 2013

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Published on: October 18, 2024

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Detection Transformer (DETR) models have advanced CNN-based object detection.
The application of DETR-like paradigms to multi-frame 3D object detection is underexplored.

Purpose of the Study:

To present STEMD, an end-to-end framework enhancing DETR for multi-frame 3D object detection.
To address challenges in spatial-temporal modeling and redundant predictions.

Main Methods:

Introduced a spatial-temporal graph attention network for inter-object interactions and temporal dependencies.
Incorporated previous frame outputs to initialize decoder queries, mitigating missing hard cases.
Utilized an IoU regularization term to suppress similar, non-positive queries and reduce redundant boxes.

Main Results:

Demonstrated effectiveness in challenging multi-frame 3D object detection scenarios.
Achieved improved performance with only a minor increase in computational overhead.
Successfully modeled complex object interactions and temporal dynamics.

Conclusions:

STEMD offers a robust enhancement to DETR-like architectures for multi-frame 3D object detection.
The proposed methods effectively handle spatial-temporal complexities and improve prediction accuracy.
STEMD presents a computationally efficient solution for advanced 3D object detection tasks.