Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

1.5K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
1.5K
Vision01:24

Vision

58.9K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
58.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A Guide to Structureless Visual Localization.

International journal of computer vision·2026
Same author

Generative chromoendoscopy for gastric neoplasms: a multicenter prospective study.

Endoscopy·2026
Same author

Taurine inhibits apolipoprotein E4 aggregation.

Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie·2026
Same author

Markerless inside-out tool tracking for endoscopic spine surgery: a benchmarking study and clinical dataset.

International journal of computer assisted radiology and surgery·2026
Same author

Training-free detection and 6D pose estimation of unseen surgical instruments.

International journal of computer assisted radiology and surgery·2026
Same author

RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems.

Sensors (Basel, Switzerland)·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Dec 5, 2025

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.5K

Long-Term Visual Localization Revisited.

Carl Toft, Will Maddern, Akihiko Torii

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |October 19, 2020
    PubMed
    Summary
    This summary is machine-generated.

    This study enhances visual localization datasets with ground truth poses for robust autonomous navigation and augmented reality. The curated datasets and benchmarking server aim to advance research in long-term visual localization and learned image features.

    More Related Videos

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
    07:12

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

    Published on: April 11, 2025

    759
    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition
    07:45

    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

    Published on: July 21, 2020

    4.8K

    Related Experiment Videos

    Last Updated: Dec 5, 2025

    Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
    09:27

    Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

    Published on: October 13, 2018

    10.5K
    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
    07:12

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

    Published on: April 11, 2025

    759
    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition
    07:45

    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

    Published on: July 21, 2020

    4.8K

    Area of Science:

    • Computer Vision
    • Robotics
    • Geographic Information Systems

    Background:

    • Visual localization is crucial for autonomous vehicles and augmented reality, requiring robustness to diverse environmental conditions.
    • Existing datasets often lack comprehensive ground truth pose information, hindering accurate evaluation of localization methods.
    • Variations in lighting, weather, and seasons significantly challenge the performance of six degree-of-freedom (6DOF) camera pose estimation.

    Purpose of the Study:

    • To extend existing visual localization datasets with accurate ground truth pose information.
    • To facilitate the evaluation of visual localization techniques under challenging, real-world viewing conditions.
    • To stimulate research in long-term visual localization and learned local image features.

    Main Methods:

    • Augmenting three publicly available datasets with precise 6DOF camera pose data.
    • Establishing a private test set from the extended datasets for unbiased benchmarking.
    • Analyzing the performance of state-of-the-art visual localization approaches on the enhanced datasets.

    Main Results:

    • The extended datasets enable detailed analysis of factors impacting 6DOF camera pose estimation accuracy.
    • State-of-the-art localization methods were evaluated, with results largely informed by server submissions.
    • A significant portion of ground truth poses were released, with a held-out test set for future research.

    Conclusions:

    • The enhanced datasets provide a valuable resource for advancing robust visual localization.
    • The benchmarking server and released data will foster progress in long-term visual localization and related fields.
    • Accurate pose estimation under varied conditions remains a critical area for autonomous systems and AR.