Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Variable Temporal Length Training for Action Recognition CNNs.

Sensors (Basel, Switzerland)·2024

Same author

Gender Recognition Based on Gradual and Ensemble Learning from Multi-View Gait Energy Images and Poses.

Sensors (Basel, Switzerland)·2023

Same author

Facial Micro-Expression Recognition Using Double-Stream 3D Convolutional Neural Network with Domain Adaptation.

Sensors (Basel, Switzerland)·2023

Same author

Consistent responses of coral microbiome to acute and chronic heat stress exposures.

Marine environmental research·2023

Same author

A novel estrogen-targeted PEGylated liposome co-delivery oxaliplatin and paclitaxel for the treatment of ovarian cancer.

Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie·2023

Same author

A single-cell atlas reveals the heterogeneity of meningeal immunity in a mouse model of Methyl CpG binding protein 2 deficiency.

Frontiers in immunology·2023

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 27, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Deep Learning-Based Monocular 3D Object Detection with Refinement of Depth Information.

Henan Hu^1,2,3, Ming Zhu¹, Muyu Li⁴

¹Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China.

Sensors (Basel, Switzerland)

|April 12, 2022

Summary

This summary is machine-generated.

This study enhances monocular 3D target detection using pseudo-LiDAR by improving depth estimation accuracy. The new method refines target positioning and reduces depth uncertainty, significantly boosting performance on the KITTI dataset.

Keywords:

3D object detection autonomous driving deep learning depth estimation monocular image point cloud

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Related Experiment Videos

Last Updated: Sep 27, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Area of Science:

Computer Vision
Robotics
Autonomous Systems

Background:

Monocular 3D target detection using pseudo-LiDAR shows promise but suffers from robustness issues.
Key limitations include inaccurate target positioning and depth distribution uncertainty, stemming from imprecise depth estimation.

Purpose of the Study:

To address the limitations of pseudo-LiDAR methods in monocular 3D target detection.
To improve the accuracy of target depth estimation and reduce uncertainty in depth distribution.

Main Methods:

A novel method combining image segmentation and geometric constraints for accurate target depth prediction and confidence measurement.
Utilizing normalized target scale as a priori information to mitigate depth distribution uncertainty (long-tail noise).
Converting refined depth maps into pseudo-LiDAR point clouds for input into LiDAR-based detection algorithms.

Main Results:

The proposed framework significantly outperforms state-of-the-art methods on the KITTI dataset.
Achieved improvements of over 12.37% (easy) and 5.34% (hard) on the KITTI validation subset.
Demonstrated superior performance on the KITTI test set with gains of 5.1% (easy) and 1.76% (hard).

Conclusions:

The developed approach effectively enhances monocular 3D target detection by refining depth estimation.
The method successfully overcomes limitations in target positioning and depth uncertainty, leading to substantial performance gains.
The framework offers a robust and accurate solution for 3D object detection using pseudo-LiDAR data.