Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Improving Translational Accuracy

Improving Translational Accuracy

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Optimization of Process Parameters of Rhamnolipid Treatment of Oily Sludge Based on Response Surface Methodology.

ACS omega·2020

Same author

Safety and Long-term Scleral Biomechanical Stability of Rhesus Eyes after Scleral Cross-linking by Blue Light.

Current eye research·2020

Same author

Serum pentraxin 3 as a biomarker for prognosis of acute minor stroke due to large artery atherosclerosis.

Brain and behavior·2020

Same author

The roles of adenosine deaminase in autoimmune diseases.

Autoimmunity reviews·2020

Same author

The role of oxidative stress in association between disinfection by-products exposure and semen quality: A mediation analysis among men from an infertility clinic.

Chemosphere·2020

Same author

Establishment of immune prognostic signature and analysis of prospective molecular mechanisms in childhood osteosarcoma patients.

Medicine·2020

Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jul 26, 2025

Author Spotlight: An Efficient and Robust Software for Automated Fusion of Multiple Preclinical Imaging Modalities

Author Spotlight: An Efficient and Robust Software for Automated Fusion of Multiple Preclinical Imaging Modalities

Published on: October 27, 2023

有效的令牌引导图像文本检索与一致的多式模式对比训练.

Chong Liu, Yuqi Zhang, Hongsong Wang

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

|June 20, 2023

概括

此摘要是机器生成的。

这项研究引入了一个统一的图像文本检索框架,结合了粗和细粒度表示. 代币引导双变压器 (TGDT) 架构提高了检索准确性和效率.

更多相关视频

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

相关实验视频

Last Updated: Jul 26, 2025

Author Spotlight: An Efficient and Robust Software for Automated Fusion of Multiple Preclinical Imaging Modalities

Author Spotlight: An Efficient and Robust Software for Automated Fusion of Multiple Preclinical Imaging Modalities

Published on: October 27, 2023

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

科学领域:

计算机科学计算机科学
人工智能的人工智能
机器学习机器学习

背景情况:

图像-文本检索对于理解视觉和语言数据之间的语义关系至关重要.
现有的方法往往专注于全球或本地特征,忽视它们的相互作用,导致精度低于最佳,计算成本高.

研究的目的:

开发一个新的框架,整合粗细粒度表示学习,以增强图像文本检索.
为了提高检索准确度和减少多式联络理解任务中的计算复杂性.

主要方法:

提出了代币引导双变压器 (TGDT) 架构,用于图像和文本处理的两个同质分支.
引入了一致的多模态对比 (CMC) 损失,以确保在共享嵌入空间中的各种模式的语义一致性.
实施了一种使用混合全球和本地跨模式相似性的两阶段推断方法.

主要成果:

在基准数据集上实现了最先进的检索性能.
与现有的代表方法相比,证明推断时间显著降低.
统一的框架有效地利用了粗细细粒度的信息.

结论:

拟议的TGDT架构通过统一多式模式表示来提供更有效和高效的图像文本检索方法.
该CMC损失和两阶段推断方法有助于卓越的语义理解和检索准确性.
这项工作为多模式学习提供了新的视角,与人类认知过程保持一致.