Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Absolute Motion Analysis- General Plane Motion

Absolute Motion Analysis- General Plane Motion

Visualize a drone, with its propellers spinning rapidly, hovering mid-air. The fascinating movements and operations of this drone can be comprehended by applying the principle of general plane motion.
As the drone's propellers rotate, an upward force is generated that counteracts the force of gravity, enabling the drone to lift off from the ground. This initial movement of the drone is along a straight path, representing a form of translational motion. In this phase, every point on the drone...

Vector Functions and Motion: Problem Solving

Vector Functions and Motion: Problem Solving

Accurate position tracking is fundamental to the safe and effective operation of unmanned aerial vehicles (UAVs), particularly during precision maneuvers near complex structures. In this scenario, a drone is programmed to perform a high-precision inspection of a vertical structure, starting at position ((x, y, z) = (3, 0, 0)), with an initial velocity oriented in the positive z-direction. The trajectory of the drone is governed by a time-dependent acceleration function a(t), which is predefined...

Planar Rigid-Body Motion

Planar Rigid-Body Motion

Understanding the movement of a rigid body in planar motion involves recognizing that every particle within this body is traversing a path that maintains a consistent distance from a specific plane. This concept is fundamental in the study of physics and mechanical engineering, and it allows us to comprehend better how objects move in space.
Planar motion is typically divided into three distinct categories. The first is rectilinear translation, demonstrated by a subway train that moves along...

Relative Motion Analysis using Rotating Axes-Problem Solving

Relative Motion Analysis using Rotating Axes-Problem Solving

Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...

Relative Motion Analysis using Rotating Axes

Relative Motion Analysis using Rotating Axes

Consider a component AB undergoing a linear motion. Along with a linear motion, point B also rotates around point A. To comprehend this complex movement, position vectors for both points A and B are established using a stationary reference frame.
However, to express the relative position of point B relative to point A, an additional frame of reference, denoted as x'y', is necessary. This additional frame not only translates but also rotates relative to the fixed frame, making it instrumental in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

HisCMCL: Cross-Modal Contrastive Learning with Hierarchical Multi-Scale Fusion for Spatial Expression Prediction.

Bioinformatics (Oxford, England)·2026

Same author

MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

YTHDF1 mediates KLF2/VSIG4 axis to regulate Kupffer cell polarization to alleviate sepsis-induced liver injury.

Genes and immunity·2025

Same author

Effects of Stevia Straw Supplementation on Meat Quality, Nutrient Composition, and Rumen Microbiota in Sheep.

Veterinary sciences·2025

Same author

Effects of dietary supplementation with hydroponic wheat seedlings on rumen fermentation, meat quality, amino acid and fatty acid contents, and rumen bacterial diversity in sheep.

Frontiers in microbiology·2025

Same author

In Situ Chemical Construction of Ultrathin Zn<sup>2+</sup>-Conductive Interphase for Dendrite-Free Zinc Metal Batteries.

Angewandte Chemie (International ed. in English)·2025

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 27, 2026

Assessing Human Spatial Navigation in a Virtual Space and its Sensitivity to Exercise

Assessing Human Spatial Navigation in a Virtual Space and its Sensitivity to Exercise

Published on: January 26, 2024

STAMP: Spatial-Temporal Anchored Motion Planning for Zero-Shot Continuous Vision-and-Language Navigation.

Tai Liu¹, Xiaoyan Qi², Liuyi Wang²

¹CRRC Qingdao Sifang Co., Ltd., No. 88 Jinhong East Road, Chengyang District, Qingdao 266111, China.

Sensors (Basel, Switzerland)

|June 26, 2026

Summary

This summary is machine-generated.

We introduce STAMP, a Spatial-Temporal Anchored Motion Planning framework to improve vision-and-language navigation (VLN) for embodied agents. STAMP enhances zero-shot navigation by grounding language instructions with spatial and temporal context.

Keywords:

embodied AI vision-and-language navigation visual navigation

Related Experiment Videos

Last Updated: Jun 27, 2026

Assessing Human Spatial Navigation in a Virtual Space and its Sensitivity to Exercise

Assessing Human Spatial Navigation in a Virtual Space and its Sensitivity to Exercise

Published on: January 26, 2024

Area of Science:

Robotics
Artificial Intelligence
Computer Vision

Background:

Vision-and-Language Navigation in continuous environments (VLN-CE) demands agents ground language into motion decisions under partial observability.
Multimodal Large Language Models (LVLMs) excel at semantic understanding but struggle with spatial grounding, embodied memory, and geometric constraints in VLN.

Purpose of the Study:

To propose STAMP, a Spatial-Temporal Anchored Motion Planning framework, to bridge the gap between pretrained world knowledge and embodied navigation for zero-shot VLN-CE.
To enhance LVLM performance in VLN by addressing limitations in spatial grounding and decision-making.

Main Methods:

STAMP employs a hierarchical design decoupling semantic reasoning from motion execution, utilizing a frozen LVLM over a structured abstraction.
A novel multimodal spatial-temporal anchoring mechanism encodes landmarks, action semantics, depth geometry, and navigation history.
An explicit Chain-of-Navigation reasoning process and an online, backtracking-enabled topological map support robust planning.

Main Results:

STAMP achieves performance comparable to state-of-the-art zero-shot methods on VLN-CE benchmarks.
The framework demonstrates effectiveness in both simulated and real-world navigation tasks.
STAMP successfully grounds natural language instructions into reliable long-horizon motion decisions.

Conclusions:

STAMP effectively enhances zero-shot vision-and-language navigation by integrating spatial-temporal reasoning and explicit planning mechanisms.
The proposed framework offers a systematic approach to improve embodied agent navigation in complex, continuous environments.
STAMP provides a robust solution for VLN-CE by addressing key challenges in perception, grounding, and decision-making.