Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Classification of 24-h movement behaviour patterns among university students and their relationship with physical fitness: a latent profile analysis.

BMC public health·2026

Same author

SAMS-Net: A Smoothness-Anchored Monotone Neural Differential Equation Network for Failure-Only-Supervised Structural Health Indicator Construction.

Sensors (Basel, Switzerland)·2026

Same author

Ligand-mediated suppression of Ostwald ripening enables low-temperature sol-gel ZnO for efficient inverted flexible organic photovoltaics.

Nature communications·2026

Same author

Associations of 24-h movement behavior with mental health in adolescent athletes: a compositional isotemporal substitution analysis.

Frontiers in public health·2026

Same author

Sirtuin 2 Regulates Dorsal Hippocampal Actin Polymerization and Microtubule Acetylation-dependent EB3 Activation to Modulate Opioid Withdrawal-Induced Aversive Memory and Associative Learning.

Biological psychiatry·2026

Same author

The HALP-CONUT integrated score predicts survival and postoperative complications in locally advanced esophageal squamous cell carcinoma following neoadjuvant therapy: evidence from a multicenter retrospective cohort study.

World journal of surgical oncology·2026

Same journal

Interpretable Model for Clinical Use in Left Atrial Appendage Segmentation via an Optimised Deformable-Attention U-Net With Spatial-Channel Fusion.

Healthcare technology letters·2026

Same journal

Driving Innovation: Transatlantic Attitudes to the <i>Bionics Bus</i> as a Vehicle for Health Transformation and STEM Engagement.

Healthcare technology letters·2026

Same journal

Gamified Digital Solutions for Tinnitus Health Literacy: The Erasmus+ Project TinWise.

Healthcare technology letters·2026

Same journal

Effect of Technology-Supported Measures Used for Care Transition Decisions for Chronic Disease Patients: A Systematic Review and Meta-Analysis.

Healthcare technology letters·2026

Same journal

Bibliometric Trends in the Integration of Computer Vision With Healthcare.

Healthcare technology letters·2026

Same journal

Parameter-Efficient Deep Learning Models for Vital Sign Estimation From PPG.

Healthcare technology letters·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 28, 2025

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Published on: August 12, 2021

Generalizable stereo depth estimation with masked image modelling.

Samyakh Tukra^1,2, Haozheng Xu¹, Chi Xu¹

¹Hamlyn Centre of Robotic Surgery, Department of Surgery and Cancer Imperial College London London UK.

Healthcare Technology Letters

|April 19, 2024

Summary

This summary is machine-generated.

This study introduces a novel two-phase training for stereo depth estimation, enhancing accuracy in 3D reconstruction. The method achieves state-of-the-art results without needing surgical data for training.

Keywords:

computer vision convolutional neural nets learning (artificial intelligence)neural nets stereo image processing

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Published on: December 4, 2013

Related Experiment Videos

Last Updated: Jun 28, 2025

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Published on: August 12, 2021

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Published on: December 4, 2013

Area of Science:

Computer Vision
Medical Imaging
3D Reconstruction

Background:

Accurate stereo depth estimation is crucial for 3D reconstruction, particularly in surgical applications.
Supervised methods excel but struggle with limited surgical ground truth data, hindering generalizability.
Self-supervised methods lack ground truth but face scale ambiguity and photometric inconsistencies.

Purpose of the Study:

To develop a generalizable and high-performance stereo depth estimation method for 3D reconstruction.
To overcome limitations of existing supervised and self-supervised approaches in surgical and natural scenes.
To achieve state-of-the-art accuracy without direct training on target scene data.

Main Methods:

A two-phase training procedure combining self-supervised masked image modeling (MIM) and supervised learning.
Phase 1: Self-supervised representation learning using MIM to acquire generalizable semantic stereo features.
Phase 2: Supervised learning on synthetic data using the MIM pre-trained model, incorporating perceptual losses to enhance stereo representations.

Main Results:

The proposed method achieves sub-millimetre accuracy on surgical scenes and lowest errors on natural scenes.
Demonstrates state-of-the-art performance in stereo depth estimation.
Qualitative and quantitative evaluations confirm the approach's effectiveness and generalizability.

Conclusions:

The two-phase training strategy effectively bridges the gap between self-supervised and supervised learning for stereo depth estimation.
The method achieves high accuracy and generalizability without requiring direct training on specific scene data like surgical or natural images.
This approach sets a new benchmark for stereo depth estimation in demanding applications such as robotic surgery.