Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device

Surveyors use Global Positioning System (GPS) technology to measure the precise location and elevation of points on Earth. In a recent survey, GPS receivers were used to determine the coordinates and elevations of two park monuments. The process involved careful mission planning, data collection, and correction to ensure accuracy. The survey began with mission planning to identify optimal satellite visibility and minimize Position Dilution of Precision (PDOP). A geodetic control point...

Design Example: Alignment of a Road Line Using GIS

Design Example: Alignment of a Road Line Using GIS

The alignment of a road line using Geographic Information Systems (GIS) is a critical process in civil engineering, combining advanced technology with practical decision-making. This methodology begins with the collection of geospatial data, including information on land cover, geomorphology, drainage patterns, slope, and contour details. Such data is typically acquired through satellite imagery and GIS tools, offering a comprehensive understanding of the terrain.Once the data is gathered, it...

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Sight Distance in a Vertical Curve

Sight Distance in a Vertical Curve

Sight distance on vertical curves is critical in roadway design. It ensures drivers can see far enough ahead to identify and respond to hazards effectively. This directly impacts safety, driver comfort, and the overall efficiency of the transportation network.Vertical curves are classified into crest and sag curves based on their geometry. For crest curves, sight distance is determined by the line of sight between a driver's eye and a small object on the road's surface. Design parameters for...

The Anchoring-and-Adjustment Heuristic

The Anchoring-and-Adjustment Heuristic

In order to make good decisions, we use our knowledge and our reasoning. Often, this knowledge and reasoning is sound and solid. However, sometimes, we are swayed by biases or by others manipulating a situation. For example, let’s say you and three friends wanted to rent a house and had a combined target budget of $1,600. The realtor shows you only very run-down houses for $1,600 and then shows you a very nice house for $2,000. Might you ask each person to pay more in rent to get the...

Visual Agnosia

Visual Agnosia

Visual agnosia is a condition characterized by the inability to recognize visually presented objects despite having normal vision. For instance, a person with visual agnosia can describe the shape and color of an object but cannot identify or name it. This impairment does not affect their visual field, acuity, color vision, brightness discrimination, language, or memory. An example of this condition in a social setting is someone at a dinner party asking for "that silver thing with a round...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

DynamicVLN: Incorporating Dynamics into Vision-and-Language Navigation Scenarios.

Sensors (Basel, Switzerland)·2025

Same author

A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans.

Sensors (Basel, Switzerland)·2025

Same author

RetinaViT: Efficient Visual Backbone for Online Video Streams.

Sensors (Basel, Switzerland)·2024

Same author

Proto-Adapter: Efficient Training-Free CLIP-Adapter for Few-Shot Image Classification.

Sensors (Basel, Switzerland)·2024

Same author

Synthetic Document Images with Diverse Shadows for Deep Shadow Removal Networks.

Sensors (Basel, Switzerland)·2024

Same author

Action Quality Assessment Model Using Specialists' Gaze Location and Kinematics Data-Focusing on Evaluating Figure Skating Jumps.

Sensors (Basel, Switzerland)·2023

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jul 23, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

户外视觉和语言导航需要对象水平对齐.

Yanjun Sun^1,2, Yue Qiu², Yoshimitsu Aoki¹

¹Department of Electronics and Electrical Engineering, Faculty of Science and Technology, Keio University, 3-14-1, Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan.

Sensors (Basel, Switzerland)

|July 14, 2023

概括

此摘要是机器生成的。

本研究介绍了一个对体AI代理的对象级对齐模块 (OAlM). OAlM通过使用地标作为子目标来改善户外视觉和语言导航 (VLN),提高复杂数据集的性能.

关键词:

具体化的人工智能基于地标的导航.视觉和语言导航导航

更多相关视频

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Published on: March 27, 2013

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

相关实验视频

Last Updated: Jul 23, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Published on: March 27, 2013

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

科学领域:

嵌入式的人工智能 (AI)
计算机视觉计算机视觉
自然语言处理自然语言处理.

背景情况:

视觉和语言导航 (VLN) 是体现人工智能的复杂多模式任务,在户外城市环境中尤其具有挑战性.
现有的户外VLN模型经常与复杂的环境和详细的指令解释作斗争,导致导航失败.
人类导航有效地利用地标作为参考点,以更有效地寻找路径.

研究的目的:

为了提高室外视觉和语言导航 (VLN) 在嵌入式AI中的性能.
解决当前模型在解释复杂环境和指令方面的局限性.
开发一种更强大,更适应性的导航代理,其灵感来源于基于人类地标的导航.

主要方法:

提出了一个对象级对齐模块 (OAlM),以指导代理人识别和使用指令指定的对象作为地标.
将公认的地标视为中间子目标,以将远程导航路径分解为更短,更容易管理的段落.
整合了OAlM与现有的全景和指令基于特征的VLN模型.

主要成果:

与基线模型相比,OAlM展示了更以对象为中心的导航策略.
拟议的方法显著超过了具有挑战性的户外VLN触摸数据集的基线.
在任务完成率 (TC) 中实现了3.19%的改善,这表明导航效率和成功得到了提高.

结论:

作为子目标,利用对象级信息和地标是一个有前途的策略,用于改进嵌入式AI导航.
在复杂,不可预测的现实环境中,OAlM提高了代理商的适应性和稳定性.
这项研究为更先进,更高效的室外导航系统铺平了道路.