Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

Search research articles

相关实验视频

Updated: Jul 3, 2025

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

强大的视觉问题答案:数据集,方法和未来的挑战.

Jie Ma, Pinghui Wang, Dechen Kong

IEEE transactions on pattern analysis and machine intelligence

|February 15, 2024

概括

此摘要是机器生成的。

相关概念视频

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Engineering vanadium extraction residue into Mn-functionalized hydroxyapatite precursor for enhanced antibiotic removal: Molecular-level insights into a waste-to-resource strategy.

Environmental research·2026

Same author

Attribution Explanations for Deep Neural Networks: A Theoretical Perspective.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

A novel method for acoustic modeling of cranial bone based on the porosity index.

Scientific reports·2025

Same author

Toward Generalizable Prompt Learning via Multi-Regularization Guided Knowledge Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2025

Same author

Identification of drought-tolerant mung bean varieties based on germination, antioxidant, and osmolyte profiles.

Protoplasma·2025

Same author

Sm<sup>3+</sup>-activated zirconate ceramics: multimodal self-calibrating photothermal feedback window for nuclear environments.

Optics letters·2025

Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026

查看所有相关文章

这项调查解决了视觉问题答案 (VQA) 系统中的偏见,这些系统通常会记住训练数据,而不是真正理解图像. 它审查数据集,指标和调试方法,以提高VQA的稳定性.

科学领域:

计算机科学计算机科学
人工智能的人工智能
机器学习机器学习

背景情况:

视觉问题答案 (VQA) 系统面临着数据偏差的挑战,导致分布之外的性能差.
现有的VQA方法往往记住偏见,而不是学习接地图像理解.

研究的目的:

为VQA提供数据集,评估指标和退化方法的全面调查.
在VQA任务中分析视觉和语言预训练模型的稳定性.
在强大的VQA中确定未来的研究方向.

主要方法:

从分销和分销之外的角度对数据集发展的概述.
检查VQA数据集中使用的评估指标.
关于现有的VQA退化方法的类型的建议,分析它们的发展,特征和比较.
对代表性视觉和语言预训练模型在VQA上的稳定性进行分析.

主要成果:

该调查对VQA数据集和评估指标进行了分类,强调了它们的演变和局限性.
介绍了脱方法的结构化类型,详细介绍了它们的方法和相对稳定性.
分析揭示了VQA.当前视觉和语言预培训模型的稳定性特征.

更多相关视频

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Methods for Presenting Real-world Objects Under Controlled Laboratory Conditions

Methods for Presenting Real-world Objects Under Controlled Laboratory Conditions

Published on: June 21, 2019

相关实验视频

Last Updated: Jul 3, 2025

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Methods for Presenting Real-world Objects Under Controlled Laboratory Conditions

Methods for Presenting Real-world Objects Under Controlled Laboratory Conditions

Published on: June 21, 2019

结论:

该研究强调了对强大的VQA系统的关键需求,这些系统可以克服数据偏差.
它综合了目前关于VQA强度的研究,为研究人员提供了基础资源.
确定了未来研究的关键领域,以推进可靠的视觉问题答案领域.