Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Schemas01:42

Schemas

11.5K
A schema is a mental construct consisting of a cluster or collection of related concepts (Bartlett, 1932). There are many different types of schemata, and they all have one thing in common: schemata are a method of organizing information that allows the brain to work more efficiently. When a schema is activated, the brain makes immediate assumptions about the person or object being observed.
11.5K
Vision01:24

Vision

52.2K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
52.2K
Storage01:23

Storage

46
A schema is a mental framework that helps individuals organize and interpret information. Schemata, formed from previous experiences, influence how we process new information: how we encode it, the inferences we make, and how we retrieve it. For instance, a schema for what a typical classroom looks like might include desks, a teacher's desk, a whiteboard, and students in such an environment. This expectation helps us quickly understand and navigate new classrooms without needing to analyze...
46

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

From metabolic to pathologic complete response: case report of Neoadjuvant Lorlatinib in Stage IIIB ALK-positive NSCLC combined with uniportal VATS.

Journal of cardiothoracic surgery·2026
Same author

LUCAT1 promotes stemness in head and neck squamous cell carcinoma by sponging miR-128.

Frontiers in oncology·2026
Same author

AAV gene therapy for homozygous familial hypercholesterolemia: a phase 1 trial.

Nature medicine·2026
Same author

Mixed-methods study on GenAI Usage, dependence behaviors, and standardized application paths among Chinese medical students.

NPJ digital medicine·2026
Same author

Chiglitazar Activates PPAR-α/γ to Suppress Oxidative Stress and Angiogenesis in Corneal Neovascularization.

Antioxidants (Basel, Switzerland)·2026
Same author

Solvent- and metal-free upcycling of low-density polyethylene using a practical ZSM-5/Al<sub>2</sub>O<sub>3</sub> bead catalyst.

Communications chemistry·2026
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
查看所有相关文章

相关实验视频

Updated: May 10, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

8.9K

以示例进行查询:使用基于LLM的场景图表表征的语义交通场景检索.

Yafu Tian1,2,3, Alexander Carballo3,4,5, Ruifeng Li2

  • 1Graduate School of Informatics, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan.

Sensors (Basel, Switzerland)
|April 26, 2025
PubMed
概括
此摘要是机器生成的。

本研究引入了一种新的方法,用于在自动驾驶中检索交通场景,使用视觉大语言模型 (VLM) 生成的道路场景图 (RSG). 这种方法增强了语义理解,并支持各种查询类型,以改善场景检索.

关键词:
通过示例查询查询.场景图表 场景图表亚图的同位体匹配与子图的同位体匹配.交通现场检索检索视觉法学士 (LLMs) 是一个视觉法学士.

更多相关视频

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.6K
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

15.6K

相关实验视频

Last Updated: May 10, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

8.9K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.6K
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

15.6K

科学领域:

  • 计算机科学 计算机科学
  • 人工智能的人工智能
  • 机器人技术 机器人技术 机器人技术

背景情况:

  • 自动驾驶系统需要从庞大的数据集中有效地检索特定的交通场景.
  • 现有的方法面临的挑战是语义复杂性和用户在交通场景数据的各种需求.
  • 自主驾驶数据的异质性和规模造成了重大检索障碍.

研究的目的:

  • 为自动驾驶提出一种新的交通场景检索方法.
  • 为了利用视觉大语言模型 (VLMs) 进行结构化的道路场景图 (RSG) 表示.
  • 为了实现交通场景的灵活和含义丰富的查询.

主要方法:

  • 利用VLMs从视频数据中生成结构化的RSG,捕获对象关系和语义属性.
  • 开发了一组可扩展的场景属性和基于图形的描述,用于量化场景相似性.
  • 创建了RSG-LLM基准数据集,包含1000个交通场景,描述和RSG用于LLM评估.

主要成果:

  • 拟议的方法有效地从大型数据库中检索语义上类似的交通场景.
  • 该方法支持各种查询格式,包括自然语言,图像,视频剪辑和rosbag文件.
  • 证明了VLM在为交通场景生成准确的RSG方面的能力.

结论:

  • 由VLM生成的RSG方法为交通场景检索提供了一个全面而灵活的框架.
  • 这种方法显著提高了自动驾驶中场景检索的效率和准确性.
  • 拟议的框架有助于在自动驾驶系统中的更广泛应用.