Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

776
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
776
Deconvolution01:20

Deconvolution

212
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
212
Downsampling01:20

Downsampling

209
When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...
209
Difference from Background: Limit of Detection01:05

Difference from Background: Limit of Detection

6.7K
The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...
6.7K
Upsampling01:22

Upsampling

275
Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...
275
Newman Projections02:06

Newman Projections

17.1K
Different notations are used to represent the three-dimensional structure of molecules on two-dimensional surfaces. One of the most commonly used representations is the dash-wedge formula. The dashed wedges, solid wedges, and the plane lines indicate the groups situated behind the plane, coming out of the plane, and in the plane, respectively.
The organic molecules rotate across the single bonds leading to numerous temporary three-dimensional structures of varying energy known as...
17.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Early-life temperature drives recruitment success in Eurasian perch (Perca fluviatilis) populations.

Journal of fish biology·2026
Same author

Automated Multi-Modal MRI Segmentation of Stroke Lesions and Corticospinal Tract Integrity for Functional Outcome Prediction.

Tomography (Ann Arbor, Mich.)·2026
Same author

Radiogenomics for Glioblastoma Survival Prediction: Integrating Radiomics, Clinical, and Genomic Features Using Artificial Intelligence.

Journal of imaging informatics in medicine·2025
Same author

Advancing breast cancer relapse prediction with radiomics and neural networks: a clinically interpretable framework.

Frontiers in oncology·2025
Same author

M-TabNet: A Transformer-Based Multi-Encoder for Early Neonatal Birth Weight Prediction Using Multimodal Data.

IEEE journal of biomedical and health informatics·2025
Same author

DeepISLES: a clinically validated ischemic stroke segmentation model from the ISLES'22 challenge.

Nature communications·2025
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Aug 5, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

592

Deep Monocular Depth Estimation Based on Content and Contextual Features.

Saddam Abdulwahab1, Hatem A Rashwan1, Najwa Sharaf1

  • 1Department of Computer Engineering and Mathematics, Universitat Rovira i Virgil, Campus Sescelades, Avinguda dels Paisos Catalans, 26, 43007 Tarragona, Spain.

Sensors (Basel, Switzerland)
|March 30, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a new deep learning method for accurate monocular depth estimation. By using semantic information, it improves depth prediction, especially in challenging areas like low-texture regions and occlusions.

Keywords:
autoencoder networkcontextual semantic informationdeep learningmonocular depth estimation

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.8K
Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition
07:45

Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

Published on: July 21, 2020

4.5K

Related Experiment Videos

Last Updated: Aug 5, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

592
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.8K
Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition
07:45

Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

Published on: July 21, 2020

4.5K

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • Deep learning methods for monocular depth estimation often struggle with low-texture areas and occlusions.
  • Existing approaches primarily rely on RGB image content and structure, limiting accuracy.

Purpose of the Study:

  • To propose a novel method for precise monocular depth map estimation using contextual semantic information.
  • To enhance the accuracy and robustness of depth estimation by preserving depth discontinuities and object boundaries.

Main Methods:

  • Leveraging a deep autoencoder network integrated with high-quality semantic features from HRNet-v2 semantic segmentation.
  • Utilizing object localization and boundary information from semantic features to guide depth prediction.

Main Results:

  • Achieved 85% accuracy on NYU Depth v2 and SUN RGB-D datasets.
  • Outperformed state-of-the-art methods, reducing error metrics (Rel by 0.12, RMS by 0.523, log10 by 0.0527).
  • Demonstrated superior performance in preserving object boundaries and detecting small structures.

Conclusions:

  • The proposed semantic-feature-enhanced method significantly improves monocular depth estimation accuracy and robustness.
  • Exploiting contextual semantic information is effective for overcoming limitations of textureless regions and occlusions in depth prediction.