Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

668
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
668
Visual System01:26

Visual System

585
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
585
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

4.2K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
4.2K
Vision01:24

Vision

53.5K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
53.5K
Relative Motion Analysis using Rotating Axes-Problem Solving01:29

Relative Motion Analysis using Rotating Axes-Problem Solving

406
Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...
406
Three-Dimensional Force System:Problem Solving01:30

Three-Dimensional Force System:Problem Solving

669
A three-dimensional force system refers to a scenario in which three forces act simultaneously in three different directions. This type of problem is commonly encountered in physics and engineering, where it is necessary to calculate the resultant force on the system, which can then be used to predict or analyze the behavior of the object or structure under consideration.
To solve a three-dimensional force system, first resolve each force into its respective scalar components. Do this using...
669

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

MVHumanNet++: A Large-scale Dataset of Multi-view Daily Dressing Human Captures with Richer Annotations for 3D Human Digitization.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

CloseUpShot: Close-Up Novel View Synthesis From Sparse-Views via Point-Conditioned Diffusion Model.

IEEE transactions on visualization and computer graphics·2025
Same author

From pretraining to privacy: federated ultrasound foundation model with self-supervised learning.

NPJ digital medicine·2025
Same author

A generalist foundation model and database for open-world medical image segmentation.

Nature biomedical engineering·2025
Same author

Boost Protein Language Model with Injected Structure Information Through Parameter Efficient Fine-tuning.

Computers in biology and medicine·2025
Same author

Reinforcement Learning With LLMs Interaction for Distributed Diffusion Model Services.

IEEE transactions on pattern analysis and machine intelligence·2025
Same journal

Blue Noise Dithering for Reservoir-based Spatio-temporal Importance Resampling.

IEEE transactions on visualization and computer graphics·2026
Same journal

ROS-GS: Relightable Outdoor Scenes With Gaussian Splatting.

IEEE transactions on visualization and computer graphics·2026
Same journal

MesoSplats: Texture Synthesis with Gaussian Splatting.

IEEE transactions on visualization and computer graphics·2026
Same journal

GLLA: A Unified Force-Directed Graph Layout Framework Supporting Local Adjustments.

IEEE transactions on visualization and computer graphics·2026
Same journal

Multi-Perception Crowd: Learning to combine entity and implicit perception for diverse crowd simulation.

IEEE transactions on visualization and computer graphics·2026
Same journal

Hiding in Plain Sight: Camouflaging Real-world Objects.

IEEE transactions on visualization and computer graphics·2026
See all related articles

Related Experiment Video

Updated: Jul 9, 2025

Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging
09:19

Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging

Published on: April 18, 2025

489

Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation.

Xu Yan, Zhihao Yuan, Yuhao Du

    IEEE Transactions on Visualization and Computer Graphics
    |December 8, 2023
    PubMed
    Summary
    This summary is machine-generated.

    We introduce CLEVR3D, a large dataset for Visual Question Answering on 3D Point Clouds (VQA-3D). This dataset enhances 3D scene understanding by generating diverse questions about object attributes and spatial relationships.

    More Related Videos

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.0K
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.8K

    Related Experiment Videos

    Last Updated: Jul 9, 2025

    Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging
    09:19

    Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging

    Published on: April 18, 2025

    489
    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.0K
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.8K

    Area of Science:

    • Computer Vision
    • Artificial Intelligence
    • 3D Scene Understanding

    Background:

    • Visual Question Answering on 3D Point Clouds (VQA-3D) is an emerging challenge.
    • Existing datasets may lack diversity and real-world complexity for robust VQA-3D models.

    Purpose of the Study:

    • To introduce CLEVR3D, a large-scale dataset for VQA-3D.
    • To facilitate comprehensive visual understanding in 3D scenes by addressing confounding biases and common-sense context.
    • To improve the performance of VQA-3D models on real-world comprehension tasks.

    Main Methods:

    • Developed a question engine using 3D scene graphs to generate diverse reasoning questions on object attributes and spatial relationships.
    • Created an initial set of 44K questions from 1,333 real-world 3D scenes.
    • Introduced compositional scene manipulation to generate 127K questions from 7,438 augmented 3D scenes, addressing confounding biases and common-sense context.

    Main Results:

    • The CLEVR3D dataset comprises 171K questions across 8,771 3D scenes.
    • The proposed challenging setup and augmented scenes improve VQA-3D model comprehension.
    • Baselines demonstrate that CLEVR3D significantly boosts performance on 3D scene understanding tasks.

    Conclusions:

    • CLEVR3D is a valuable resource for advancing VQA-3D research.
    • The dataset's diverse and challenging nature pushes the boundaries of 3D visual understanding.
    • Future VQA-3D models trained on CLEVR3D are expected to exhibit enhanced real-world applicability.