Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

Search research articles

相关实验视频

Updated: Jan 17, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

半监督的基于文本的人物搜索

Daming Gao, Yang Bai, Min Cao

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

|September 16, 2025

概括

此摘要是机器生成的。

相关概念视频

Sign Test for Matched Pairs

Sign Test for Matched Pairs

The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

LoRASculpt: Harmonious Low-Rank Adaptation for Multimodal Large Language Models.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

Identification of <i>LiMYC</i> and <i>LiTPS</i> Gene Families Involved in MeJA-Induced Terpene Accumulation in <i>Lagerstroemia indica</i> 'Whit III'.

Plants (Basel, Switzerland)·2026

Same author

HiSymGeo: Hierarchical Context Symbiosis for Cross-View Object-Level Image Geo-Localization.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

A high-resolution dataset on costs and greenhouse gas emissions of battery recycling in China.

Scientific data·2026

Same author

Molecular glue degraders of HuR suppress BRAF-mutant colorectal cancer.

Nature·2026

Same author

Hippo Signaling Suppresses Cell Ploidy and Tumorigenesis through Skp2.

Cancer cell·2026

Same journal

Mask-guided Asymmetric Contrastive and Semantic Alignment for Unsupervised Person Re-Identification.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Hyperbolic Cycle Alignment for Infrared-Visible Image Fusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Learning Gaze Synthesizer via 3D-eye Controlled Diffusion and Cross-domain Feature Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Underlying Semantic Diffusion for Effective and Efficient In-Context Learning.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

DiffRES: Unleashing Text-to-Image Diffusion Models for Generative Referring Expression Segmentation without Information Leakage.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Location Matters: Frequency-Spatial Dual Space Adaptation for Cross-Domain Few-Shot Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

查看所有相关文章

这项研究引入了一种基于文本的人员搜索 (TBPS) 的新型半监督方法,克服了数据注释的挑战. 拟议的框架有效地处理噪音生成的数据,提高检索准确性,注释有限.

科学领域:

计算机科学计算机科学
人工智能的人工智能
机器学习机器学习

背景情况:

基于文本的人员搜索 (TBPS) 通常需要广泛的注释图像文本数据来实现完全监督的学习.
获取人形图像的大型数据集的详细文本描述是实际上具有挑战性和资源密集的.

研究的目的:

在半监督的环境中探索和开发TBPS的有效方法,使用有限的注释数据.
为了解决现有的TBPS方法的性能限制,由于图像-文本对的注释很少.

主要方法:

提出了一个两阶段的生成-然后-检索框架,从使用图像标题生成未注释图像的伪文本开始.
引入了一个噪音强大的检索框架,包括混合补丁通道掩盖 (PC-Mask) 和噪音引导渐进训练 (NP-Train).
PC-Mask通过在补丁和通道级别上掩盖输入数据来改进模型架构,以减轻噪音杂的伪文本的过度匹配.
NP-Train通过基于伪文本噪音水平的逐步调度来增强训练,以实现稳健的学习.

主要成果:

拟议的半监督的TBPS框架在多个基准指标中显示出有前途的表现.
噪音强策略 (PC-Mask和NP-Train) 显著提高了模型处理杂伪文本数据的能力.
生成然后检索方法有效地利用有限的注释数据和生成的伪文本来改进检索.

相关实验视频

Last Updated: Jan 17, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

结论:

半监督学习是TBPS的可行和有效方法,显著减少对完全注释的数据集的依赖.
开发的噪音强度检索框架为改善在低注释场景中的TBPS性能提供了实际解决方案.
这项研究通过解决数据稀缺性挑战,为更具可扩展性和高效的TBPS系统铺平了道路.