Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Convolution Properties II

Convolution Properties II

The important convolution properties include width, area, differentiation, and integration properties.
The width property indicates that if the durations of input signals are T1 and T2, then the width of the output response equals the sum of both durations, irrespective of the shapes of the two functions. For instance, convolving two rectangular pulses with durations of 2 seconds and 1 second results in a function with a width of 3 seconds.
The area property asserts that the area under the...

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

Convolution Properties I

Convolution Properties I

Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:

Detection of Black Holes

Detection of Black Holes

Although black holes were theoretically postulated in the 1920s, they remained outside the domain of observational astronomy until the 1970s.
Their closest cousins are neutron stars, which are composed almost entirely of neutrons packed against each other, making them extremely dense. A neutron star has the same mass as the Sun but its diameter is only a few kilometers. Therefore, the escape velocity from their surface is close to the speed of light.
Not until the 1960s, when the first neutron...

Convolution: Math, Graphics, and Discrete Signals

Convolution: Math, Graphics, and Discrete Signals

In any LTI (Linear Time-Invariant) system, the convolution of two signals is denoted using a convolution operator, assuming all initial conditions are zero. The convolution integral can be divided into two parts: the zero-input or natural response and the zero-state or forced response, with t0 indicating the initial time.
To simplify the convolution integral, it is assumed that both the input signal and impulse response are zero for negative time values. The graphical convolution process...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Identifying Negative Biomarkers of CD4 Cytotoxic T Lymphocytes via Granzyme B Exclusion Analysis.

European journal of immunology·2026

Same author

Mitochondrial-impaired ductal epithelium fuels TLS formation in Sjögren's disease.

Annals of the rheumatic diseases·2026

Same author

Exosomes from joints mediate depressive-like behaviors in MIA arthritis mice.

Brain research bulletin·2026

Same author

A multi-region flexible neural interface for behavioral state decoding in freely moving mice.

Microsystems & nanoengineering·2026

Same author

Neurobehavioral toxicity of paroxetine in Gambusia affinis: Dissociated behavioral syndromes and impaired monoaminergic neurotransmission.

Environmental pollution (Barking, Essex : 1987)·2026

Same author

Development and validation of early-stage and progression prediction models for chronic kidney disease: a retrospective study.

PeerJ·2026

Same journal

Analysis of strength degradation of coal and rock masses and stability of mined areas under long term immersion environment.

PloS one·2026

Same journal

Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus.

PloS one·2026

Same journal

Preparation and physicochemical characterization of a biodegradable chitosan/carboxymethyl cellulose hydrogel synthesized in NaOH/urea medium.

PloS one·2026

Same journal

Action-guilt, survivor-guilt, and depression in combat-related PTSD.

PloS one·2026

Same journal

Explainable machine learning for predicting activities of daily living at discharge in stroke patients: A retrospective study using SHAP interpretability.

PloS one·2026

Same journal

Deep learning based two-way feature depiction model for brain tumor detection.

PloS one·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 26, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

MonoDCN: Monocular 3D object detection based on dynamic convolution.

Shenming Qu¹, Xinyu Yang¹, Yiming Gao¹

¹School of Software, Henan University, Kaifeng, Henan, China.

|October 4, 2022

Summary

This summary is machine-generated.

This study introduces dynamic convolution guided by depth maps for monocular 3D object detection. This method effectively integrates depth and semantic information, improving autonomous driving perception accuracy.

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Correlative Microscopy for 3D Structural Analysis of Dynamic Interactions

Correlative Microscopy for 3D Structural Analysis of Dynamic Interactions

Published on: June 24, 2013

Related Experiment Videos

Last Updated: Aug 26, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Correlative Microscopy for 3D Structural Analysis of Dynamic Interactions

Correlative Microscopy for 3D Structural Analysis of Dynamic Interactions

Published on: June 24, 2013

Area of Science:

Computer Vision
Autonomous Driving Systems
Machine Learning

Background:

Monocular 3D object detection is crucial for autonomous driving perception.
Current methods using RGB images or pseudo-radar point clouds have limitations, including complexity, inefficiency, noise, and ignoring semantic information.
Existing image-based methods often require high-precision depth estimation or struggle to use depth and semantic data simultaneously.

Purpose of the Study:

To develop a novel approach for monocular 3D object detection that effectively utilizes both depth and semantic information.
To improve the accuracy and efficiency of 3D object detection in autonomous driving scenarios.
To address the limitations of existing methods in handling depth information and semantic context.

Main Methods:

Introduction of dynamic convolution guided by depth maps into the feature extraction network.
The convolution kernel dynamically learns from the image's depth map.
Integration of depth map information directly within the feature extraction process.

Main Results:

Successfully enabled simultaneous use of depth and semantic information.
Significantly improved the accuracy of monocular 3D object detection.
Demonstrated enhanced performance on both monocular 3D object detection and Bird's Eye View tasks on the KITTI dataset.

Conclusions:

The proposed dynamic convolution method offers a more effective way to leverage depth information in monocular 3D object detection.
This approach overcomes previous limitations, leading to more robust and accurate environmental perception for autonomous vehicles.
The method shows significant potential for advancing autonomous driving technology.