Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

1.3K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
1.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Lipid Metabolic Effects Induced by Individual and Combined Exposure to Multiple Food Additives in Human Cells.

Toxics·2026
Same author

From Single Atom to Five-Atom Cluster Catalysts on Boron-Doped Diamond: Interface Engineering and Dynamic Active Sites Exploration for Acidic OER.

The journal of physical chemistry letters·2026
Same author

Research progress of high-entropy catalysts in electrochemical oxidation of organic small molecules.

Chemical communications (Cambridge, England)·2026
Same author

Design and evolution of the tetracycline repressor into sulfonylurea herbicide-responsive gene switches for field crops.

Nature communications·2026
Same author

An integrated microfluidic chip for the capture, migration and molecular phenotyping of single circulating tumor cells.

Biosensors & bioelectronics·2026
Same author

d-Orbital modulation of high-entropy sulfides with amorphous/crystalline heterostructures for simultaneous hydrogen production and sulfur recovery.

Chemical science·2026
Same journal

Style-Aware Contrastive Test-Time Adaptation: A Dual-Cache Model for Robust Vision-Language Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Semantic Frame Interpolation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Physics-Guided Cross-Modal Decoupling with Test-Time Adaptation for Hyperspectral Image Restoration.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Nov 8, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

744

MLDA-Net: Multi-Level Dual Attention-Based Network for Self-Supervised Monocular Depth Estimation.

Xibin Song, Wei Li, Dingfu Zhou

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |April 26, 2021
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces MLDA-Net, a novel framework for self-supervised monocular depth estimation. It overcomes blurriness in existing methods, producing sharper depth maps with richer details for improved accuracy.

    More Related Videos

    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
    12:39

    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

    Published on: January 18, 2020

    7.9K
    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    4.7K

    Related Experiment Videos

    Last Updated: Nov 8, 2025

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    744
    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
    12:39

    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

    Published on: January 18, 2020

    7.9K
    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    4.7K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Deep Learning

    Background:

    • Supervised learning for depth estimation requires extensive, costly annotations.
    • Self-supervised methods are desirable but often produce blurry depth maps with lost details.

    Purpose of the Study:

    • To develop a novel framework, MLDA-Net, for self-supervised monocular depth estimation.
    • To generate per-pixel depth maps with sharper boundaries and richer details.

    Main Methods:

    • Implemented a multi-level feature extraction (MLFE) strategy for hierarchical representation learning.
    • Introduced a dual-attention strategy (global and structure attention) to enhance features.
    • Utilized a reweighted loss strategy based on multi-level outputs for effective supervision.

    Main Results:

    • MLDA-Net achieves state-of-the-art results on the KITTI benchmark for self-supervised monocular depth estimation.
    • Demonstrated superior performance across different input and training modes.
    • Validated effectiveness on additional benchmark datasets.

    Conclusions:

    • MLDA-Net effectively addresses limitations of existing self-supervised depth estimation methods.
    • The proposed framework produces significantly improved depth maps with enhanced detail and sharpness.
    • MLDA-Net represents a significant advancement in self-supervised monocular depth estimation.