Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

605
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
605
Perceptual Constancy01:12

Perceptual Constancy

364
Perceptual constancy is the ability to recognize that objects remain consistent and unchanged even when their appearance varies due to changes in sensory input. There are four main types of perceptual constancy: size constancy, shape constancy, color constancy, and brightness constancy.
Size constancy is the recognition that an object remains the same size, even when its image on the retina changes. For instance, a bus is perceived to be large enough to carry people, even if it looks tiny from...
364

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

GEOMETRY OF LONG-TAILED REPRESENTATION LEARNING: REBALANCING FEATURES FOR SKEWED DISTRIBUTIONS.

... International Conference on Learning Representations·2026
Same author

Recent Progress of Single-Ion Conducting Polymer Electrolytes for Rechargeable Mono- and Multivalent Cation-Based Metal Batteries.

Angewandte Chemie (International ed. in English)·2026
Same author

Setup-Independent Full Projector Compensation.

IEEE transactions on visualization and computer graphics·2026
Same author

DiffPC: Diffusion-Based Projector Photometric Compensation.

IEEE transactions on visualization and computer graphics·2026
Same author

Probing uric acid-related prognostic genes and their molecular mechanisms in prostate cancer based on transcriptomic data.

Discover oncology·2026
Same author

Photoredox-catalysed cyclisation of carbyne equivalents with alkenes and aldehydes to dihydrofurans.

Chemical communications (Cambridge, England)·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jun 12, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

485

Weakly Supervised Monocular 3D Object Detection by Spatial-Temporal View Consistency.

Wencheng Han, Runzhou Tao, Haibin Ling

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |September 24, 2024
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a weakly-supervised method for monocular 3D object detection using only 2D labels, bridging the gap in training data for self-driving cars. The approach enhances detector robustness by leveraging spatial and temporal view consistency, eliminating the need for LiDAR-based 3D ground truths.

    More Related Videos

    Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues
    08:04

    Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

    Published on: December 4, 2013

    4.5K
    Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects
    06:36

    Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

    Published on: October 18, 2024

    905

    Related Experiment Videos

    Last Updated: Jun 12, 2025

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    485
    Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues
    08:04

    Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

    Published on: December 4, 2013

    4.5K
    Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects
    06:36

    Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

    Published on: October 18, 2024

    905

    Area of Science:

    • Computer Vision
    • Robotics
    • Artificial Intelligence

    Background:

    • Monocular 3D object detection is vital for autonomous vehicles but suffers from a training-inference data discrepancy.
    • Current methods require 3D ground truths from LiDAR during training, hindering the use of real-world driving data.
    • This gap prevents continuous improvement of 3D object detectors in production vehicles.

    Purpose of the Study:

    • To develop a weakly-supervised approach for monocular 3D object detection that eliminates the need for 3D ground truths.
    • To establish a connected data loop for continuous improvement of 3D object detection models.
    • To enhance the robustness and accuracy of 3D object detectors by utilizing only 2D labels.

    Main Methods:

    • A weakly-supervised learning framework utilizing only 2D labels for training monocular 3D object detectors.
    • Implementation of spatial view consistency through projection and multi-view techniques for optimizing object location and size.
    • Leveraging temporal view consistency and introducing temporal movement consistency to address dynamic scenes and improve 3D bounding box prediction.

    Main Results:

    • Achieved performance comparable to fully supervised methods using only 2D ground truths.
    • Demonstrated the effectiveness of spatial and temporal view consistency in regulating 3D bounding box predictions.
    • Showcased the method's utility as a pre-training strategy, yielding significant gains when fine-tuned with limited 3D data.

    Conclusions:

    • Weakly-supervised monocular 3D object detection using 2D labels is feasible and effective.
    • The proposed view consistency techniques are crucial for accurate 3D perception without 3D ground truth.
    • This approach offers a viable solution for data-efficient training and continuous learning in autonomous driving systems.