Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

1.0K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
1.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Variable Temporal Length Training for Action Recognition CNNs.

Sensors (Basel, Switzerland)·2024
Same author

Gender Recognition Based on Gradual and Ensemble Learning from Multi-View Gait Energy Images and Poses.

Sensors (Basel, Switzerland)·2023
Same author

Facial Micro-Expression Recognition Using Double-Stream 3D Convolutional Neural Network with Domain Adaptation.

Sensors (Basel, Switzerland)·2023
Same author

Consistent responses of coral microbiome to acute and chronic heat stress exposures.

Marine environmental research·2023
Same author

A novel estrogen-targeted PEGylated liposome co-delivery oxaliplatin and paclitaxel for the treatment of ovarian cancer.

Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie·2023
Same author

A single-cell atlas reveals the heterogeneity of meningeal immunity in a mouse model of Methyl CpG binding protein 2 deficiency.

Frontiers in immunology·2023
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Sep 27, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

662

Deep Learning-Based Monocular 3D Object Detection with Refinement of Depth Information.

Henan Hu1,2,3, Ming Zhu1, Muyu Li4

  • 1Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China.

Sensors (Basel, Switzerland)
|April 12, 2022
PubMed
Summary
This summary is machine-generated.

This study enhances monocular 3D target detection using pseudo-LiDAR by improving depth estimation accuracy. The new method refines target positioning and reduces depth uncertainty, significantly boosting performance on the KITTI dataset.

Keywords:
3D object detectionautonomous drivingdeep learningdepth estimationmonocular imagepoint cloud

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

2.0K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.0K

Related Experiment Videos

Last Updated: Sep 27, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

662
A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

2.0K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.0K

Area of Science:

  • Computer Vision
  • Robotics
  • Autonomous Systems

Background:

  • Monocular 3D target detection using pseudo-LiDAR shows promise but suffers from robustness issues.
  • Key limitations include inaccurate target positioning and depth distribution uncertainty, stemming from imprecise depth estimation.

Purpose of the Study:

  • To address the limitations of pseudo-LiDAR methods in monocular 3D target detection.
  • To improve the accuracy of target depth estimation and reduce uncertainty in depth distribution.

Main Methods:

  • A novel method combining image segmentation and geometric constraints for accurate target depth prediction and confidence measurement.
  • Utilizing normalized target scale as a priori information to mitigate depth distribution uncertainty (long-tail noise).
  • Converting refined depth maps into pseudo-LiDAR point clouds for input into LiDAR-based detection algorithms.

Main Results:

  • The proposed framework significantly outperforms state-of-the-art methods on the KITTI dataset.
  • Achieved improvements of over 12.37% (easy) and 5.34% (hard) on the KITTI validation subset.
  • Demonstrated superior performance on the KITTI test set with gains of 5.1% (easy) and 1.76% (hard).

Conclusions:

  • The developed approach effectively enhances monocular 3D target detection by refining depth estimation.
  • The method successfully overcomes limitations in target positioning and depth uncertainty, leading to substantial performance gains.
  • The framework offers a robust and accurate solution for 3D object detection using pseudo-LiDAR data.