Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

2.7K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
2.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

P3C-DNet: Pseudo-Groundtruth Contrastive Learning With Color Calibration Dehazing Network.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Making Machine Learning Accessible for Developmental Science: The Case of Automated Face Detection.

Developmental science·2026
Same author

Gas-Phase and Condensed-Phase Synergy in a Nonflammable Electrolyte for Highly Stable Sodium-Ion Batteries.

Journal of the American Chemical Society·2026
Same author

Spatial Multiple Importance Sampling for Real-Time Irradiance Probes.

IEEE transactions on visualization and computer graphics·2026
Same author

SeparateGen: Semantic Component-Based 3D Character Generation From Single Images.

IEEE transactions on visualization and computer graphics·2026
Same author

A Physics-Informed Demonstration-Guided Learning Framework for Granular Material Manipulation.

IEEE transactions on neural networks and learning systems·2025
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Apr 29, 2026

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data
12:08

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

Published on: August 13, 2014

24.5K

Learning Virtual View Selection for 3D Scene Semantic Segmentation.

Tai-Jiang Mu, Ming-Yuan Shen, Yu-Kun Lai

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |July 10, 2024
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new framework for 3D scene understanding by generating informative virtual 2D views. This approach enhances 3D semantic segmentation accuracy by overcoming limitations of real-world captured images.

    More Related Videos

    A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells
    12:49

    A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

    Published on: September 28, 2019

    12.8K
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.7K

    Related Experiment Videos

    Last Updated: Apr 29, 2026

    From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data
    12:08

    From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

    Published on: August 13, 2014

    24.5K
    A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells
    12:49

    A Method for 3D Reconstruction and Virtual Reality Analysis of Glial and Neuronal Cells

    Published on: September 28, 2019

    12.8K
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.7K

    Area of Science:

    • Computer Vision
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Joint 2D-3D learning is crucial for 3D vision tasks like semantic segmentation, leveraging complementary data.
    • Current methods using only real 2D images suffer from redundancy, occlusion, and limited fields of view, hindering performance.
    • Effective 3D scene understanding requires overcoming the limitations of standard 2D image inputs.

    Purpose of the Study:

    • To propose a general framework for joint 2D-3D scene understanding by selecting informative virtual 2D views.
    • To improve 3D semantic segmentation by integrating generated virtual views with 3D geometry data.
    • To enhance deep neural models for 3D vision tasks through a novel view selection strategy.

    Main Methods:

    • Generating virtual 2D views based on an information score map derived from 3D scene semantic segmentation results.
    • Formalizing the information score map learning as a deep reinforcement learning process with rewards for accurate predictions.
    • Employing an efficient greedy virtual view coverage strategy in 6D space (coordinates and normals) for optimal surface coverage.

    Main Results:

    • Validated the framework on ScanNet v2 and S3DIS datasets, demonstrating consistent gains over baseline models.
    • Achieved new state-of-the-art accuracy for joint 2D and 3D scene semantic segmentation.
    • The proposed method effectively improves performance for both joint 2D-3D and pure 3D input deep neural models.

    Conclusions:

    • The proposed virtual view selection framework significantly enhances 3D scene understanding.
    • This approach effectively addresses the limitations of real-world 2D image data in 3D vision tasks.
    • The method offers a general and effective solution for improving deep learning models in 3D semantic segmentation.