Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Lipid Metabolic Effects Induced by Individual and Combined Exposure to Multiple Food Additives in Human Cells.

Toxics·2026

Same author

From Single Atom to Five-Atom Cluster Catalysts on Boron-Doped Diamond: Interface Engineering and Dynamic Active Sites Exploration for Acidic OER.

The journal of physical chemistry letters·2026

Same author

Research progress of high-entropy catalysts in electrochemical oxidation of organic small molecules.

Chemical communications (Cambridge, England)·2026

Same author

Design and evolution of the tetracycline repressor into sulfonylurea herbicide-responsive gene switches for field crops.

Nature communications·2026

Same author

An integrated microfluidic chip for the capture, migration and molecular phenotyping of single circulating tumor cells.

Biosensors & bioelectronics·2026

Same author

d-Orbital modulation of high-entropy sulfides with amorphous/crystalline heterostructures for simultaneous hydrogen production and sulfur recovery.

Chemical science·2026

Same journal

Style-Aware Contrastive Test-Time Adaptation: A Dual-Cache Model for Robust Vision-Language Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Semantic Frame Interpolation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Physics-Guided Cross-Modal Decoupling with Test-Time Adaptation for Hyperspectral Image Restoration.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 8, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

MLDA-Net: Multi-Level Dual Attention-Based Network for Self-Supervised Monocular Depth Estimation.

Xibin Song, Wei Li, Dingfu Zhou

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|April 26, 2021

Summary

This summary is machine-generated.

This study introduces MLDA-Net, a novel framework for self-supervised monocular depth estimation. It overcomes blurriness in existing methods, producing sharper depth maps with richer details for improved accuracy.

More Related Videos

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Related Experiment Videos

Last Updated: Nov 8, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Area of Science:

Computer Vision
Machine Learning
Deep Learning

Background:

Supervised learning for depth estimation requires extensive, costly annotations.
Self-supervised methods are desirable but often produce blurry depth maps with lost details.

Purpose of the Study:

To develop a novel framework, MLDA-Net, for self-supervised monocular depth estimation.
To generate per-pixel depth maps with sharper boundaries and richer details.

Main Methods:

Implemented a multi-level feature extraction (MLFE) strategy for hierarchical representation learning.
Introduced a dual-attention strategy (global and structure attention) to enhance features.
Utilized a reweighted loss strategy based on multi-level outputs for effective supervision.

Main Results:

MLDA-Net achieves state-of-the-art results on the KITTI benchmark for self-supervised monocular depth estimation.
Demonstrated superior performance across different input and training modes.
Validated effectiveness on additional benchmark datasets.

Conclusions:

MLDA-Net effectively addresses limitations of existing self-supervised depth estimation methods.
The proposed framework produces significantly improved depth maps with enhanced detail and sharpness.
MLDA-Net represents a significant advancement in self-supervised monocular depth estimation.