Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

P3C-DNet: Pseudo-Groundtruth Contrastive Learning With Color Calibration Dehazing Network.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Making Machine Learning Accessible for Developmental Science: The Case of Automated Face Detection.

Developmental science·2026

Same author

Gas-Phase and Condensed-Phase Synergy in a Nonflammable Electrolyte for Highly Stable Sodium-Ion Batteries.

Journal of the American Chemical Society·2026

Same author

Spatial Multiple Importance Sampling for Real-Time Irradiance Probes.

IEEE transactions on visualization and computer graphics·2026

Same author

SeparateGen: Semantic Component-Based 3D Character Generation From Single Images.

IEEE transactions on visualization and computer graphics·2026

Same author

A Physics-Informed Demonstration-Guided Learning Framework for Granular Material Manipulation.

IEEE transactions on neural networks and learning systems·2025

Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 29, 2026

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

Published on: August 13, 2014

Learning Virtual View Selection for 3D Scene Semantic Segmentation.

Tai-Jiang Mu, Ming-Yuan Shen, Yu-Kun Lai

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|July 10, 2024

Summary

This summary is machine-generated.

This study introduces a new framework for 3D scene understanding by generating informative virtual 2D views. This approach enhances 3D semantic segmentation accuracy by overcoming limitations of real-world captured images.

More Related Videos

A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

Published on: September 28, 2019

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Related Experiment Videos

Last Updated: Apr 29, 2026

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

Published on: August 13, 2014

A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

Published on: September 28, 2019

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Joint 2D-3D learning is crucial for 3D vision tasks like semantic segmentation, leveraging complementary data.
Current methods using only real 2D images suffer from redundancy, occlusion, and limited fields of view, hindering performance.
Effective 3D scene understanding requires overcoming the limitations of standard 2D image inputs.

Purpose of the Study:

To propose a general framework for joint 2D-3D scene understanding by selecting informative virtual 2D views.
To improve 3D semantic segmentation by integrating generated virtual views with 3D geometry data.
To enhance deep neural models for 3D vision tasks through a novel view selection strategy.

Main Methods:

Generating virtual 2D views based on an information score map derived from 3D scene semantic segmentation results.
Formalizing the information score map learning as a deep reinforcement learning process with rewards for accurate predictions.
Employing an efficient greedy virtual view coverage strategy in 6D space (coordinates and normals) for optimal surface coverage.

Main Results:

Validated the framework on ScanNet v2 and S3DIS datasets, demonstrating consistent gains over baseline models.
Achieved new state-of-the-art accuracy for joint 2D and 3D scene semantic segmentation.
The proposed method effectively improves performance for both joint 2D-3D and pure 3D input deep neural models.

Conclusions:

The proposed virtual view selection framework significantly enhances 3D scene understanding.
This approach effectively addresses the limitations of real-world 2D image data in 3D vision tasks.
The method offers a general and effective solution for improving deep learning models in 3D semantic segmentation.