Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

843
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
843
Convolution Properties II01:17

Convolution Properties II

267
The important convolution properties include width, area, differentiation, and integration properties.
The width property indicates that if the durations of input signals are T1 and T2, then the width of the output response equals the sum of both durations, irrespective of the shapes of the two functions. For instance, convolving two rectangular pulses with durations of 2 seconds and 1 second results in a function with a width of 3 seconds.
The area property asserts that the area under the...
267
Deconvolution01:20

Deconvolution

236
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
236
Convolution Properties I01:20

Convolution Properties I

220
Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:
220
Detection of Black Holes01:10

Detection of Black Holes

2.3K
Although black holes were theoretically postulated in the 1920s, they remained outside the domain of observational astronomy until the 1970s.
Their closest cousins are neutron stars, which are composed almost entirely of neutrons packed against each other, making them extremely dense. A neutron star has the same mass as the Sun but its diameter is only a few kilometers. Therefore, the escape velocity from their surface is close to the speed of light.
Not until the 1960s, when the first neutron...
2.3K
Convolution: Math, Graphics, and Discrete Signals01:24

Convolution: Math, Graphics, and Discrete Signals

352
In any LTI (Linear Time-Invariant) system, the convolution of two signals is denoted using a convolution operator, assuming all initial conditions are zero. The convolution integral can be divided into two parts: the zero-input or natural response and the zero-state or forced response, with t0 indicating the initial time.
To simplify the convolution integral, it is assumed that both the input signal and impulse response are zero for negative time values. The graphical convolution process...
352

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Identifying Negative Biomarkers of CD4 Cytotoxic T Lymphocytes via Granzyme B Exclusion Analysis.

European journal of immunology·2026
Same author

Mitochondrial-impaired ductal epithelium fuels TLS formation in Sjögren's disease.

Annals of the rheumatic diseases·2026
Same author

Exosomes from joints mediate depressive-like behaviors in MIA arthritis mice.

Brain research bulletin·2026
Same author

A multi-region flexible neural interface for behavioral state decoding in freely moving mice.

Microsystems & nanoengineering·2026
Same author

Neurobehavioral toxicity of paroxetine in Gambusia affinis: Dissociated behavioral syndromes and impaired monoaminergic neurotransmission.

Environmental pollution (Barking, Essex : 1987)·2026
Same author

Development and validation of early-stage and progression prediction models for chronic kidney disease: a retrospective study.

PeerJ·2026
Same journal

Analysis of strength degradation of coal and rock masses and stability of mined areas under long term immersion environment.

PloS one·2026
Same journal

Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus.

PloS one·2026
Same journal

Preparation and physicochemical characterization of a biodegradable chitosan/carboxymethyl cellulose hydrogel synthesized in NaOH/urea medium.

PloS one·2026
Same journal

Action-guilt, survivor-guilt, and depression in combat-related PTSD.

PloS one·2026
Same journal

Explainable machine learning for predicting activities of daily living at discharge in stroke patients: A retrospective study using SHAP interpretability.

PloS one·2026
Same journal

Deep learning based two-way feature depiction model for brain tumor detection.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Aug 26, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

612

MonoDCN: Monocular 3D object detection based on dynamic convolution.

Shenming Qu1, Xinyu Yang1, Yiming Gao1

  • 1School of Software, Henan University, Kaifeng, Henan, China.

Plos One
|October 4, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces dynamic convolution guided by depth maps for monocular 3D object detection. This method effectively integrates depth and semantic information, improving autonomous driving perception accuracy.

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.9K
Correlative Microscopy for 3D Structural Analysis of Dynamic Interactions
13:43

Correlative Microscopy for 3D Structural Analysis of Dynamic Interactions

Published on: June 24, 2013

14.2K

Related Experiment Videos

Last Updated: Aug 26, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

612
A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.9K
Correlative Microscopy for 3D Structural Analysis of Dynamic Interactions
13:43

Correlative Microscopy for 3D Structural Analysis of Dynamic Interactions

Published on: June 24, 2013

14.2K

Area of Science:

  • Computer Vision
  • Autonomous Driving Systems
  • Machine Learning

Background:

  • Monocular 3D object detection is crucial for autonomous driving perception.
  • Current methods using RGB images or pseudo-radar point clouds have limitations, including complexity, inefficiency, noise, and ignoring semantic information.
  • Existing image-based methods often require high-precision depth estimation or struggle to use depth and semantic data simultaneously.

Purpose of the Study:

  • To develop a novel approach for monocular 3D object detection that effectively utilizes both depth and semantic information.
  • To improve the accuracy and efficiency of 3D object detection in autonomous driving scenarios.
  • To address the limitations of existing methods in handling depth information and semantic context.

Main Methods:

  • Introduction of dynamic convolution guided by depth maps into the feature extraction network.
  • The convolution kernel dynamically learns from the image's depth map.
  • Integration of depth map information directly within the feature extraction process.

Main Results:

  • Successfully enabled simultaneous use of depth and semantic information.
  • Significantly improved the accuracy of monocular 3D object detection.
  • Demonstrated enhanced performance on both monocular 3D object detection and Bird's Eye View tasks on the KITTI dataset.

Conclusions:

  • The proposed dynamic convolution method offers a more effective way to leverage depth information in monocular 3D object detection.
  • This approach overcomes previous limitations, leading to more robust and accurate environmental perception for autonomous vehicles.
  • The method shows significant potential for advancing autonomous driving technology.