Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

643
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
643

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Classification of 24-h movement behaviour patterns among university students and their relationship with physical fitness: a latent profile analysis.

BMC public health·2026
Same author

SAMS-Net: A Smoothness-Anchored Monotone Neural Differential Equation Network for Failure-Only-Supervised Structural Health Indicator Construction.

Sensors (Basel, Switzerland)·2026
Same author

Ligand-mediated suppression of Ostwald ripening enables low-temperature sol-gel ZnO for efficient inverted flexible organic photovoltaics.

Nature communications·2026
Same author

Associations of 24-h movement behavior with mental health in adolescent athletes: a compositional isotemporal substitution analysis.

Frontiers in public health·2026
Same author

Sirtuin 2 Regulates Dorsal Hippocampal Actin Polymerization and Microtubule Acetylation-dependent EB3 Activation to Modulate Opioid Withdrawal-Induced Aversive Memory and Associative Learning.

Biological psychiatry·2026
Same author

The HALP-CONUT integrated score predicts survival and postoperative complications in locally advanced esophageal squamous cell carcinoma following neoadjuvant therapy: evidence from a multicenter retrospective cohort study.

World journal of surgical oncology·2026
Same journal

Interpretable Model for Clinical Use in Left Atrial Appendage Segmentation via an Optimised Deformable-Attention U-Net With Spatial-Channel Fusion.

Healthcare technology letters·2026
Same journal

Driving Innovation: Transatlantic Attitudes to the <i>Bionics Bus</i> as a Vehicle for Health Transformation and STEM Engagement.

Healthcare technology letters·2026
Same journal

Gamified Digital Solutions for Tinnitus Health Literacy: The Erasmus+ Project TinWise.

Healthcare technology letters·2026
Same journal

Effect of Technology-Supported Measures Used for Care Transition Decisions for Chronic Disease Patients: A Systematic Review and Meta-Analysis.

Healthcare technology letters·2026
Same journal

Bibliometric Trends in the Integration of Computer Vision With Healthcare.

Healthcare technology letters·2026
Same journal

Parameter-Efficient Deep Learning Models for Vital Sign Estimation From PPG.

Healthcare technology letters·2026
See all related articles

Related Experiment Video

Updated: Jun 28, 2025

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery
05:12

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Published on: August 12, 2021

2.0K

Generalizable stereo depth estimation with masked image modelling.

Samyakh Tukra1,2, Haozheng Xu1, Chi Xu1

  • 1Hamlyn Centre of Robotic Surgery, Department of Surgery and Cancer Imperial College London London UK.

Healthcare Technology Letters
|April 19, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a novel two-phase training for stereo depth estimation, enhancing accuracy in 3D reconstruction. The method achieves state-of-the-art results without needing surgical data for training.

Keywords:
computer visionconvolutional neural netslearning (artificial intelligence)neural netsstereo image processing

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.7K
Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues
08:04

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Published on: December 4, 2013

4.5K

Related Experiment Videos

Last Updated: Jun 28, 2025

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery
05:12

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Published on: August 12, 2021

2.0K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.7K
Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues
08:04

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Published on: December 4, 2013

4.5K

Area of Science:

  • Computer Vision
  • Medical Imaging
  • 3D Reconstruction

Background:

  • Accurate stereo depth estimation is crucial for 3D reconstruction, particularly in surgical applications.
  • Supervised methods excel but struggle with limited surgical ground truth data, hindering generalizability.
  • Self-supervised methods lack ground truth but face scale ambiguity and photometric inconsistencies.

Purpose of the Study:

  • To develop a generalizable and high-performance stereo depth estimation method for 3D reconstruction.
  • To overcome limitations of existing supervised and self-supervised approaches in surgical and natural scenes.
  • To achieve state-of-the-art accuracy without direct training on target scene data.

Main Methods:

  • A two-phase training procedure combining self-supervised masked image modeling (MIM) and supervised learning.
  • Phase 1: Self-supervised representation learning using MIM to acquire generalizable semantic stereo features.
  • Phase 2: Supervised learning on synthetic data using the MIM pre-trained model, incorporating perceptual losses to enhance stereo representations.

Main Results:

  • The proposed method achieves sub-millimetre accuracy on surgical scenes and lowest errors on natural scenes.
  • Demonstrates state-of-the-art performance in stereo depth estimation.
  • Qualitative and quantitative evaluations confirm the approach's effectiveness and generalizability.

Conclusions:

  • The two-phase training strategy effectively bridges the gap between self-supervised and supervised learning for stereo depth estimation.
  • The method achieves high accuracy and generalizability without requiring direct training on specific scene data like surgical or natural images.
  • This approach sets a new benchmark for stereo depth estimation in demanding applications such as robotic surgery.