Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
Absolute Motion Analysis- General Plane Motion01:24

Absolute Motion Analysis- General Plane Motion

Visualize a drone, with its propellers spinning rapidly, hovering mid-air. The fascinating movements and operations of this drone can be comprehended by applying the principle of general plane motion.
As the drone's propellers rotate, an upward force is generated that counteracts the force of gravity, enabling the drone to lift off from the ground. This initial movement of the drone is along a straight path, representing a form of translational motion. In this phase, every point on the drone...
Vector Functions and Motion: Problem Solving01:30

Vector Functions and Motion: Problem Solving

Accurate position tracking is fundamental to the safe and effective operation of unmanned aerial vehicles (UAVs), particularly during precision maneuvers near complex structures. In this scenario, a drone is programmed to perform a high-precision inspection of a vertical structure, starting at position ((x, y, z) = (3, 0, 0)), with an initial velocity oriented in the positive z-direction. The trajectory of the drone is governed by a time-dependent acceleration function a(t), which is predefined...
Planar Rigid-Body Motion01:22

Planar Rigid-Body Motion

Understanding the movement of a rigid body in planar motion involves recognizing that every particle within this body is traversing a path that maintains a consistent distance from a specific plane. This concept is fundamental in the study of physics and mechanical engineering, and it allows us to comprehend better how objects move in space.
Planar motion is typically divided into three distinct categories. The first is rectilinear translation, demonstrated by a subway train that moves along...
Relative Motion Analysis using Rotating Axes-Problem Solving01:29

Relative Motion Analysis using Rotating Axes-Problem Solving

Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...
Relative Motion Analysis using Rotating Axes01:25

Relative Motion Analysis using Rotating Axes

Consider a component AB undergoing a linear motion. Along with a linear motion, point B also rotates around point A. To comprehend this complex movement, position vectors for both points A and B are established using a stationary reference frame.
However, to express the relative position of point B relative to point A, an additional frame of reference, denoted as x'y', is necessary. This additional frame not only translates but also rotates relative to the fixed frame, making it instrumental in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

HisCMCL: Cross-Modal Contrastive Learning with Hierarchical Multi-Scale Fusion for Spatial Expression Prediction.

Bioinformatics (Oxford, England)ยท2026
Same author

MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation.

IEEE transactions on pattern analysis and machine intelligenceยท2026
Same author

YTHDF1 mediates KLF2/VSIG4 axis to regulate Kupffer cell polarization to alleviate sepsis-induced liver injury.

Genes and immunityยท2025
Same author

Effects of Stevia Straw Supplementation on Meat Quality, Nutrient Composition, and Rumen Microbiota in Sheep.

Veterinary sciencesยท2025
Same author

Effects of dietary supplementation with hydroponic wheat seedlings on rumen fermentation, meat quality, amino acid and fatty acid contents, and rumen bacterial diversity in sheep.

Frontiers in microbiologyยท2025
Same author

In Situ Chemical Construction of Ultrathin Zn<sup>2+</sup>-Conductive Interphase for Dendrite-Free Zinc Metal Batteries.

Angewandte Chemie (International ed. in English)ยท2025
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)ยท2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)ยท2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)ยท2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)ยท2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)ยท2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)ยท2026
See all related articles

Related Experiment Video

Updated: Jun 27, 2026

Assessing Human Spatial Navigation in a Virtual Space and its Sensitivity to Exercise
06:17

Assessing Human Spatial Navigation in a Virtual Space and its Sensitivity to Exercise

Published on: January 26, 2024

STAMP: Spatial-Temporal Anchored Motion Planning for Zero-Shot Continuous Vision-and-Language Navigation.

Tai Liu1, Xiaoyan Qi2, Liuyi Wang2

  • 1CRRC Qingdao Sifang Co., Ltd., No. 88 Jinhong East Road, Chengyang District, Qingdao 266111, China.

Sensors (Basel, Switzerland)
|June 26, 2026
PubMed
Summary
This summary is machine-generated.

We introduce STAMP, a Spatial-Temporal Anchored Motion Planning framework to improve vision-and-language navigation (VLN) for embodied agents. STAMP enhances zero-shot navigation by grounding language instructions with spatial and temporal context.

Keywords:
embodied AIvision-and-language navigationvisual navigation

Related Experiment Videos

Last Updated: Jun 27, 2026

Assessing Human Spatial Navigation in a Virtual Space and its Sensitivity to Exercise
06:17

Assessing Human Spatial Navigation in a Virtual Space and its Sensitivity to Exercise

Published on: January 26, 2024

Area of Science:

  • Robotics
  • Artificial Intelligence
  • Computer Vision

Background:

  • Vision-and-Language Navigation in continuous environments (VLN-CE) demands agents ground language into motion decisions under partial observability.
  • Multimodal Large Language Models (LVLMs) excel at semantic understanding but struggle with spatial grounding, embodied memory, and geometric constraints in VLN.

Purpose of the Study:

  • To propose STAMP, a Spatial-Temporal Anchored Motion Planning framework, to bridge the gap between pretrained world knowledge and embodied navigation for zero-shot VLN-CE.
  • To enhance LVLM performance in VLN by addressing limitations in spatial grounding and decision-making.

Main Methods:

  • STAMP employs a hierarchical design decoupling semantic reasoning from motion execution, utilizing a frozen LVLM over a structured abstraction.
  • A novel multimodal spatial-temporal anchoring mechanism encodes landmarks, action semantics, depth geometry, and navigation history.
  • An explicit Chain-of-Navigation reasoning process and an online, backtracking-enabled topological map support robust planning.

Main Results:

  • STAMP achieves performance comparable to state-of-the-art zero-shot methods on VLN-CE benchmarks.
  • The framework demonstrates effectiveness in both simulated and real-world navigation tasks.
  • STAMP successfully grounds natural language instructions into reliable long-horizon motion decisions.

Conclusions:

  • STAMP effectively enhances zero-shot vision-and-language navigation by integrating spatial-temporal reasoning and explicit planning mechanisms.
  • The proposed framework offers a systematic approach to improve embodied agent navigation in complex, continuous environments.
  • STAMP provides a robust solution for VLN-CE by addressing key challenges in perception, grounding, and decision-making.