Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

Downsampling

Downsampling

When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...

Difference from Background: Limit of Detection

Difference from Background: Limit of Detection

The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...

Upsampling

Upsampling

Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...

Newman Projections

Newman Projections

Different notations are used to represent the three-dimensional structure of molecules on two-dimensional surfaces. One of the most commonly used representations is the dash-wedge formula. The dashed wedges, solid wedges, and the plane lines indicate the groups situated behind the plane, coming out of the plane, and in the plane, respectively.
The organic molecules rotate across the single bonds leading to numerous temporary three-dimensional structures of varying energy known as...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Early-life temperature drives recruitment success in Eurasian perch (Perca fluviatilis) populations.

Journal of fish biology·2026

Same author

Automated Multi-Modal MRI Segmentation of Stroke Lesions and Corticospinal Tract Integrity for Functional Outcome Prediction.

Tomography (Ann Arbor, Mich.)·2026

Same author

Radiogenomics for Glioblastoma Survival Prediction: Integrating Radiomics, Clinical, and Genomic Features Using Artificial Intelligence.

Journal of imaging informatics in medicine·2025

Same author

Advancing breast cancer relapse prediction with radiomics and neural networks: a clinically interpretable framework.

Frontiers in oncology·2025

Same author

M-TabNet: A Transformer-Based Multi-Encoder for Early Neonatal Birth Weight Prediction Using Multimodal Data.

IEEE journal of biomedical and health informatics·2025

Same author

DeepISLES: a clinically validated ischemic stroke segmentation model from the ISLES'22 challenge.

Nature communications·2025

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 5, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Deep Monocular Depth Estimation Based on Content and Contextual Features.

Saddam Abdulwahab¹, Hatem A Rashwan¹, Najwa Sharaf¹

¹Department of Computer Engineering and Mathematics, Universitat Rovira i Virgil, Campus Sescelades, Avinguda dels Paisos Catalans, 26, 43007 Tarragona, Spain.

Sensors (Basel, Switzerland)

|March 30, 2023

Summary

This summary is machine-generated.

This study introduces a new deep learning method for accurate monocular depth estimation. By using semantic information, it improves depth prediction, especially in challenging areas like low-texture regions and occlusions.

Keywords:

autoencoder network contextual semantic information deep learning monocular depth estimation

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

Published on: July 21, 2020

Related Experiment Videos

Last Updated: Aug 5, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

Published on: July 21, 2020

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Deep learning methods for monocular depth estimation often struggle with low-texture areas and occlusions.
Existing approaches primarily rely on RGB image content and structure, limiting accuracy.

Purpose of the Study:

To propose a novel method for precise monocular depth map estimation using contextual semantic information.
To enhance the accuracy and robustness of depth estimation by preserving depth discontinuities and object boundaries.

Main Methods:

Leveraging a deep autoencoder network integrated with high-quality semantic features from HRNet-v2 semantic segmentation.
Utilizing object localization and boundary information from semantic features to guide depth prediction.

Main Results:

Achieved 85% accuracy on NYU Depth v2 and SUN RGB-D datasets.
Outperformed state-of-the-art methods, reducing error metrics (Rel by 0.12, RMS by 0.523, log10 by 0.0527).
Demonstrated superior performance in preserving object boundaries and detecting small structures.

Conclusions:

The proposed semantic-feature-enhanced method significantly improves monocular depth estimation accuracy and robustness.
Exploiting contextual semantic information is effective for overcoming limitations of textureless regions and occlusions in depth prediction.