Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

473
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
473

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Turning Motion into Methane: Electromagnetic Induction in Microbial Aggregates Enhances Wastewater Resource Recovery.

Environmental science & technology·2026
Same author

Reply.

Journal of vascular surgery. Venous and lymphatic disorders·2026
Same author

Rapid evolution driven by translocation-associated selection during meiosis.

EMBO reports·2026
Same author

High-pressure synthesis and high-performance half metallicity of quadruple perovskite oxide DyCu<sub>3</sub>Fe<sub>2</sub>Re<sub>2</sub>O<sub>12</sub>.

Fundamental research·2026
Same author

Structure-Based Design and Optimization of Novel PHGDH Inhibitors for Overcoming Erlotinib-Resistant Lung Cancer.

Journal of medicinal chemistry·2026
Same author

Metabolic dysregulation and antibody-mediated rejection after kidney transplantation: interacting mechanisms and emerging clinical strategies.

Frontiers in immunology·2026
Same journal

Correction: A method for supervoxel-wise association studies of age and other non-imaging variables from coronary computed tomography angiograms.

Scientific reports·2026
Same journal

Poly(bromophenol blue)/CoSn(OH)<sub>6</sub> cubic particles modified pencil graphite electrode for electrochemical determination of diphenhydramine.

Scientific reports·2026
Same journal

Dietary Chlorella, Spirulina, and acidifier modulate jejunal cytokine-related gene expression in broiler chickens.

Scientific reports·2026
Same journal

Perceived physical activity barriers in university students: associations with fatigue and eating behaviours.

Scientific reports·2026
Same journal

Refuge limitation structures habitat use in agricultural landscapes: evidence from Sunda pangolins.

Scientific reports·2026
Same journal

Lightweight stateless transaction verification with outsourced witness updates for UTXO blockchains.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: May 13, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

434

A simple monocular depth estimation network for balancing complexity and accuracy.

Xuanxuan Liu1, Shuai Tang2, Mengdie Feng1

  • 1Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, 518000, Guangdong, Shenzhen, China.

Scientific Reports
|April 14, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces SimMDE, a novel model for monocular depth estimation. It achieves high accuracy and computational efficiency by treating depth estimation as ordinal regression with sparse attention mechanisms.

Keywords:
Adaptive binsDeformable cross-attentionMonocular depth estimationTransformer

More Related Videos

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders
05:49

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Published on: November 1, 2024

611
Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery
05:12

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Published on: August 12, 2021

1.9K

Related Experiment Videos

Last Updated: May 13, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

434
Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders
05:49

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Published on: November 1, 2024

611
Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery
05:12

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Published on: August 12, 2021

1.9K

Area of Science:

  • Computer Vision
  • Machine Learning

Background:

  • Monocular depth estimation is vital for visual tasks but often requires high computational cost.
  • Existing methods struggle to balance accuracy with efficiency in practical applications.

Purpose of the Study:

  • To develop a computationally efficient and accurate monocular depth estimation model.
  • To address the trade-off between performance and resource consumption in depth prediction.

Main Methods:

  • Proposing SimMDE, a novel model treating monocular depth estimation as ordinal regression.
  • Utilizing a Deformable Cross-Attention Feature Fusion (DCF) decoder with sparse attention to reduce Transformer complexity.
  • Introducing Local Multi-dimensional Convolutional Attention (LMC) and Wavelet Attention Transformer (WAT) modules for enhanced feature extraction and pixel-level classification.

Main Results:

  • SimMDE achieves exceptional accuracy on NYU and KITTI benchmark datasets.
  • The model demonstrates significant improvements in absolute relative error (AbsRel) by 11.7% (NYU) and 10.3% (KITTI).
  • SimMDE achieves these results with fewer parameters and high computational efficiency.

Conclusions:

  • SimMDE offers a promising solution for accurate and efficient monocular depth estimation.
  • The proposed model effectively integrates multi-scale features and enhances local feature extraction.
  • SimMDE represents a significant advancement over existing methods, particularly for real-world applications.