Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Perceptual Constancy

Perceptual Constancy

Perceptual constancy is the ability to recognize that objects remain consistent and unchanged even when their appearance varies due to changes in sensory input. There are four main types of perceptual constancy: size constancy, shape constancy, color constancy, and brightness constancy.
Size constancy is the recognition that an object remains the same size, even when its image on the retina changes. For instance, a bus is perceived to be large enough to carry people, even if it looks tiny from...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

GEOMETRY OF LONG-TAILED REPRESENTATION LEARNING: REBALANCING FEATURES FOR SKEWED DISTRIBUTIONS.

... International Conference on Learning Representations·2026

Same author

Recent Progress of Single-Ion Conducting Polymer Electrolytes for Rechargeable Mono- and Multivalent Cation-Based Metal Batteries.

Angewandte Chemie (International ed. in English)·2026

Same author

Setup-Independent Full Projector Compensation.

IEEE transactions on visualization and computer graphics·2026

Same author

DiffPC: Diffusion-Based Projector Photometric Compensation.

IEEE transactions on visualization and computer graphics·2026

Same author

Probing uric acid-related prognostic genes and their molecular mechanisms in prostate cancer based on transcriptomic data.

Discover oncology·2026

Same author

Photoredox-catalysed cyclisation of carbyne equivalents with alkenes and aldehydes to dihydrofurans.

Chemical communications (Cambridge, England)·2026

Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 12, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Weakly Supervised Monocular 3D Object Detection by Spatial-Temporal View Consistency.

Wencheng Han, Runzhou Tao, Haibin Ling

IEEE Transactions on Pattern Analysis and Machine Intelligence

|September 24, 2024

Summary

This summary is machine-generated.

This study introduces a weakly-supervised method for monocular 3D object detection using only 2D labels, bridging the gap in training data for self-driving cars. The approach enhances detector robustness by leveraging spatial and temporal view consistency, eliminating the need for LiDAR-based 3D ground truths.

More Related Videos

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Published on: December 4, 2013

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Published on: October 18, 2024

Related Experiment Videos

Last Updated: Jun 12, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

Published on: December 4, 2013

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Published on: October 18, 2024

Area of Science:

Computer Vision
Robotics
Artificial Intelligence

Background:

Monocular 3D object detection is vital for autonomous vehicles but suffers from a training-inference data discrepancy.
Current methods require 3D ground truths from LiDAR during training, hindering the use of real-world driving data.
This gap prevents continuous improvement of 3D object detectors in production vehicles.

Purpose of the Study:

To develop a weakly-supervised approach for monocular 3D object detection that eliminates the need for 3D ground truths.
To establish a connected data loop for continuous improvement of 3D object detection models.
To enhance the robustness and accuracy of 3D object detectors by utilizing only 2D labels.

Main Methods:

A weakly-supervised learning framework utilizing only 2D labels for training monocular 3D object detectors.
Implementation of spatial view consistency through projection and multi-view techniques for optimizing object location and size.
Leveraging temporal view consistency and introducing temporal movement consistency to address dynamic scenes and improve 3D bounding box prediction.

Main Results:

Achieved performance comparable to fully supervised methods using only 2D ground truths.
Demonstrated the effectiveness of spatial and temporal view consistency in regulating 3D bounding box predictions.
Showcased the method's utility as a pre-training strategy, yielding significant gains when fine-tuned with limited 3D data.

Conclusions:

Weakly-supervised monocular 3D object detection using 2D labels is feasible and effective.
The proposed view consistency techniques are crucial for accurate 3D perception without 3D ground truth.
This approach offers a viable solution for data-efficient training and continuous learning in autonomous driving systems.