Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

869
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
869
Relative Motion Analysis using Rotating Axes-Problem Solving01:29

Relative Motion Analysis using Rotating Axes-Problem Solving

444
Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...
444
Visual System01:26

Visual System

664
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
664

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Generalized Kullback-Leibler Divergence Loss.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

In situ macrophage CAR programming via ROS-responsive microneedles enhances efferocytosis and promotes mucosal regeneration.

Biomaterials·2026
Same author

Exosomal miR-93-5p modulates macrophage polarization to enhance prostate cancer progression.

Archives of biochemistry and biophysics·2026
Same author

Correction: DNAJC5 promotes cisplatin resistance in epithelial ovarian cancer by autophagy induced by the BiP/IRE1α/XBP1 endoplasmic reticulum stress pathway.

Scientific reports·2026
Same author

Endothelial IRE1α promotes thrombospondin-1 mRNA decay and supports metabolic stress adaptation of pancreatic islets.

Nature communications·2026
Same author

Mini-Gemini: Mining the Potential of Multi-Modality Vision Language Models.

IEEE transactions on pattern analysis and machine intelligence·2025
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Sep 2, 2025

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery
05:12

Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

Published on: August 12, 2021

2.1K

DSGN++: Exploiting Visual-Spatial Relation for Stereo-Based 3D Detectors.

Yilun Chen, Shijia Huang, Shu Liu

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |August 8, 2022
    PubMed
    Summary
    This summary is machine-generated.

    DSGN++ enhances camera-based 3D object detection by improving 2D-to-3D information flow. This advanced stereo detector achieves superior performance on the KITTI benchmark.

    More Related Videos

    Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues
    08:04

    Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

    Published on: December 4, 2013

    4.5K
    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition
    07:45

    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

    Published on: July 21, 2020

    4.5K

    Related Experiment Videos

    Last Updated: Sep 2, 2025

    Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery
    05:12

    Robotized Testing of Camera Positions to Determine Ideal Configuration for Stereo 3D Visualization of Open-Heart Surgery

    Published on: August 12, 2021

    2.1K
    Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues
    08:04

    Measuring Sensitivity to Viewpoint Change with and without Stereoscopic Cues

    Published on: December 4, 2013

    4.5K
    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition
    07:45

    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

    Published on: July 21, 2020

    4.5K

    Area of Science:

    • Computer Vision
    • Robotics
    • Machine Learning

    Background:

    • Camera-based 3D object detection offers a cost-effective alternative to LiDAR sensors.
    • Existing stereo detectors like DSGN have limitations in representing 3D geometry and semantics effectively.
    • Improving the 2D-to-3D information pipeline is crucial for enhancing detection accuracy.

    Purpose of the Study:

    • To propose an advanced stereo detector, DSGN++, that improves upon the DSGN model.
    • To enhance the effective information flow throughout the 2D-to-3D detection pipeline.
    • To achieve state-of-the-art performance in camera-based 3D object detection.

    Main Methods:

    • Depth-wise Plane Sweeping (DPS) for lifting 2D information to stereo volumes with denser connections and depth-guided features.
    • Dual-view Stereo Volume (DSV) integrating front and top views for improved feature grasping and sub-voxel depth reconstruction.
    • Stereo-LiDAR Copy-Paste strategy for enhanced data efficiency and cross-modal alignment, addressing foreground region dominance.

    Main Results:

    • DSGN++ consistently outperforms existing camera-based 3D object detectors on the KITTI benchmark across all categories.
    • The proposed methods (DPS, DSV, Stereo-LiDAR Copy-Paste) contribute significantly to the performance gains.
    • The approach demonstrates effectiveness across various modality setups.

    Conclusions:

    • DSGN++ represents a significant advancement in camera-based 3D object detection.
    • The proposed techniques effectively address key challenges in stereo volume construction and data augmentation.
    • The method offers a robust and high-performing solution for 3D object detection using only cameras.