Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device

Surveyors use Global Positioning System (GPS) technology to measure the precise location and elevation of points on Earth. In a recent survey, GPS receivers were used to determine the coordinates and elevations of two park monuments. The process involved careful mission planning, data collection, and correction to ensure accuracy. The survey began with mission planning to identify optimal satellite visibility and minimize Position Dilution of Precision (PDOP). A geodetic control point...

Design Example: Alignment of a Road Line Using GIS

Design Example: Alignment of a Road Line Using GIS

The alignment of a road line using Geographic Information Systems (GIS) is a critical process in civil engineering, combining advanced technology with practical decision-making. This methodology begins with the collection of geospatial data, including information on land cover, geomorphology, drainage patterns, slope, and contour details. Such data is typically acquired through satellite imagery and GIS tools, offering a comprehensive understanding of the terrain.Once the data is gathered, it...

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Sight Distance in a Vertical Curve

Sight Distance in a Vertical Curve

Sight distance on vertical curves is critical in roadway design. It ensures drivers can see far enough ahead to identify and respond to hazards effectively. This directly impacts safety, driver comfort, and the overall efficiency of the transportation network.Vertical curves are classified into crest and sag curves based on their geometry. For crest curves, sight distance is determined by the line of sight between a driver's eye and a small object on the road's surface. Design parameters for...

The Anchoring-and-Adjustment Heuristic

The Anchoring-and-Adjustment Heuristic

In order to make good decisions, we use our knowledge and our reasoning. Often, this knowledge and reasoning is sound and solid. However, sometimes, we are swayed by biases or by others manipulating a situation. For example, let’s say you and three friends wanted to rent a house and had a combined target budget of $1,600. The realtor shows you only very run-down houses for $1,600 and then shows you a very nice house for $2,000. Might you ask each person to pay more in rent to get the...

Visual Agnosia

Visual Agnosia

Visual agnosia is a condition characterized by the inability to recognize visually presented objects despite having normal vision. For instance, a person with visual agnosia can describe the shape and color of an object but cannot identify or name it. This impairment does not affect their visual field, acuity, color vision, brightness discrimination, language, or memory. An example of this condition in a social setting is someone at a dinner party asking for "that silver thing with a round...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

DynamicVLN: Incorporating Dynamics into Vision-and-Language Navigation Scenarios.

Sensors (Basel, Switzerland)·2025

Same author

A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans.

Sensors (Basel, Switzerland)·2025

Same author

RetinaViT: Efficient Visual Backbone for Online Video Streams.

Sensors (Basel, Switzerland)·2024

Same author

Proto-Adapter: Efficient Training-Free CLIP-Adapter for Few-Shot Image Classification.

Sensors (Basel, Switzerland)·2024

Same author

Synthetic Document Images with Diverse Shadows for Deep Shadow Removal Networks.

Sensors (Basel, Switzerland)·2024

Same author

Action Quality Assessment Model Using Specialists' Gaze Location and Kinematics Data-Focusing on Evaluating Figure Skating Jumps.

Sensors (Basel, Switzerland)·2023

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 23, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Outdoor Vision-and-Language Navigation Needs Object-Level Alignment.

Yanjun Sun^1,2, Yue Qiu², Yoshimitsu Aoki¹

¹Department of Electronics and Electrical Engineering, Faculty of Science and Technology, Keio University, 3-14-1, Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan.

Sensors (Basel, Switzerland)

|July 14, 2023

Summary

This summary is machine-generated.

This study introduces an object-level alignment module (OAlM) for embodied AI agents. The OAlM improves outdoor vision-and-language navigation (VLN) by using landmarks as sub-goals, enhancing performance on complex datasets.

Keywords:

embodied AI landmark-based navigation vision-and-language navigation

More Related Videos

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Published on: March 27, 2013

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

Related Experiment Videos

Last Updated: Jul 23, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Published on: March 27, 2013

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

Area of Science:

Embodied Artificial Intelligence (AI)
Computer Vision
Natural Language Processing

Background:

Vision-and-Language Navigation (VLN) is a complex multi-modal task in embodied AI, particularly challenging in outdoor urban environments.
Existing outdoor VLN models often struggle with complex environments and detailed instruction interpretation, leading to navigation failures.
Human navigation effectively utilizes landmarks as reference points for more efficient pathfinding.

Purpose of the Study:

To enhance outdoor vision-and-language navigation (VLN) performance in embodied AI.
To address the limitations of current models in interpreting complex environments and instructions.
To develop a more robust and adaptable navigation agent inspired by human landmark-based navigation.

Main Methods:

Proposed an object-level alignment module (OAlM) to guide agents towards recognizing and utilizing instruction-specified objects as landmarks.
Treated recognized landmarks as intermediate sub-goals to decompose long-range navigation paths into shorter, manageable segments.
Integrated the OAlM with existing panorama and instruction feature-based VLN models.

Main Results:

The OAlM demonstrated a more object-focused navigation strategy compared to baseline models.
The proposed approach significantly outperformed the baseline on the challenging outdoor VLN Touchdown dataset.
Achieved a 3.19% improvement in task completion (TC) rate, indicating enhanced navigation efficiency and success.

Conclusions:

Leveraging object-level information and landmarks as sub-goals is a promising strategy for improving embodied AI navigation.
The OAlM enhances agent adaptability and robustness in complex, unpredictable real-world environments.
This research paves the way for more advanced and efficient outdoor navigation systems in embodied AI.