Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device01:30

Design Example: Identifying the Locations of Monuments in the Field Using Global Positioning System Device

82
Surveyors use Global Positioning System (GPS) technology to measure the precise location and elevation of points on Earth. In a recent survey, GPS receivers were used to determine the coordinates and elevations of two park monuments. The process involved careful mission planning, data collection, and correction to ensure accuracy. The survey began with mission planning to identify optimal satellite visibility and minimize Position Dilution of Precision (PDOP). A geodetic control point...
82
Design Example: Alignment of a Road Line Using GIS01:17

Design Example: Alignment of a Road Line Using GIS

72
The alignment of a road line using Geographic Information Systems (GIS) is a critical process in civil engineering, combining advanced technology with practical decision-making. This methodology begins with the collection of geospatial data, including information on land cover, geomorphology, drainage patterns, slope, and contour details. Such data is typically acquired through satellite imagery and GIS tools, offering a comprehensive understanding of the terrain.Once the data is gathered, it...
72
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

735
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
735
Sight Distance in a Vertical Curve01:29

Sight Distance in a Vertical Curve

80
Sight distance on vertical curves is critical in roadway design. It ensures drivers can see far enough ahead to identify and respond to hazards effectively. This directly impacts safety, driver comfort, and the overall efficiency of the transportation network.Vertical curves are classified into crest and sag curves based on their geometry. For crest curves, sight distance is determined by the line of sight between a driver's eye and a small object on the road's surface. Design parameters for...
80
The Anchoring-and-Adjustment Heuristic01:25

The Anchoring-and-Adjustment Heuristic

7.3K
In order to make good decisions, we use our knowledge and our reasoning. Often, this knowledge and reasoning is sound and solid. However, sometimes, we are swayed by biases or by others manipulating a situation. For example, let’s say you and three friends wanted to rent a house and had a combined target budget of $1,600. The realtor shows you only very run-down houses for $1,600 and then shows you a very nice house for $2,000. Might you ask each person to pay more in rent to get the...
7.3K
Visual Agnosia01:12

Visual Agnosia

241
Visual agnosia is a condition characterized by the inability to recognize visually presented objects despite having normal vision. For instance, a person with visual agnosia can describe the shape and color of an object but cannot identify or name it. This impairment does not affect their visual field, acuity, color vision, brightness discrimination, language, or memory. An example of this condition in a social setting is someone at a dinner party asking for "that silver thing with a round...
241

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

DynamicVLN: Incorporating Dynamics into Vision-and-Language Navigation Scenarios.

Sensors (Basel, Switzerland)·2025
Same author

A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans.

Sensors (Basel, Switzerland)·2025
Same author

RetinaViT: Efficient Visual Backbone for Online Video Streams.

Sensors (Basel, Switzerland)·2024
Same author

Proto-Adapter: Efficient Training-Free CLIP-Adapter for Few-Shot Image Classification.

Sensors (Basel, Switzerland)·2024
Same author

Synthetic Document Images with Diverse Shadows for Deep Shadow Removal Networks.

Sensors (Basel, Switzerland)·2024
Same author

Action Quality Assessment Model Using Specialists' Gaze Location and Kinematics Data-Focusing on Evaluating Figure Skating Jumps.

Sensors (Basel, Switzerland)·2023
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
查看所有相关文章

相关实验视频

Updated: Jul 23, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K

户外视觉和语言导航需要对象水平对齐.

Yanjun Sun1,2, Yue Qiu2, Yoshimitsu Aoki1

  • 1Department of Electronics and Electrical Engineering, Faculty of Science and Technology, Keio University, 3-14-1, Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan.

Sensors (Basel, Switzerland)
|July 14, 2023
PubMed
概括
此摘要是机器生成的。

本研究介绍了一个对体AI代理的对象级对齐模块 (OAlM). OAlM通过使用地标作为子目标来改善户外视觉和语言导航 (VLN),提高复杂数据集的性能.

关键词:
具体化的人工智能基于地标的导航.视觉和语言导航导航

更多相关视频

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind
09:01

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Published on: March 27, 2013

14.4K
Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function
06:17

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

2.0K

相关实验视频

Last Updated: Jul 23, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K
Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind
09:01

Development of an Audio-based Virtual Gaming Environment to Assist with Navigation Skills in the Blind

Published on: March 27, 2013

14.4K
Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function
06:17

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

2.0K

科学领域:

  • 嵌入式的人工智能 (AI)
  • 计算机视觉 计算机视觉
  • 自然语言处理自然语言处理.

背景情况:

  • 视觉和语言导航 (VLN) 是体现人工智能的复杂多模式任务,在户外城市环境中尤其具有挑战性.
  • 现有的户外VLN模型经常与复杂的环境和详细的指令解释作斗争,导致导航失败.
  • 人类导航有效地利用地标作为参考点,以更有效地寻找路径.

研究的目的:

  • 为了提高室外视觉和语言导航 (VLN) 在嵌入式AI中的性能.
  • 解决当前模型在解释复杂环境和指令方面的局限性.
  • 开发一种更强大,更适应性的导航代理,其灵感来源于基于人类地标的导航.

主要方法:

  • 提出了一个对象级对齐模块 (OAlM),以指导代理人识别和使用指令指定的对象作为地标.
  • 将公认的地标视为中间子目标,以将远程导航路径分解为更短,更容易管理的段落.
  • 整合了OAlM与现有的全景和指令基于特征的VLN模型.

主要成果:

  • 与基线模型相比,OAlM展示了更以对象为中心的导航策略.
  • 拟议的方法显著超过了具有挑战性的户外VLN触摸数据集的基线.
  • 在任务完成率 (TC) 中实现了3.19%的改善,这表明导航效率和成功得到了提高.

结论:

  • 作为子目标,利用对象级信息和地标是一个有前途的策略,用于改进嵌入式AI导航.
  • 在复杂,不可预测的现实环境中,OAlM提高了代理商的适应性和稳定性.
  • 这项研究为更先进,更高效的室外导航系统铺平了道路.