Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关实验视频

Updated: Jan 17, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

814

半监督的基于文本的人物搜索

Daming Gao, Yang Bai, Min Cao

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
    |September 16, 2025
    PubMed
    概括
    此摘要是机器生成的。

    相关概念视频

    Sign Test for Matched Pairs01:17

    Sign Test for Matched Pairs

    383
    The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
    To conduct the sign test, we first calculate the differences in...
    383

    您也可能阅读

    相关文章

    通过共同作者、期刊和引用图与本文相关的文章。

    排序
    Same author

    LoRASculpt: Harmonious Low-Rank Adaptation for Multimodal Large Language Models.

    IEEE transactions on pattern analysis and machine intelligence·2026
    Same author

    Identification of <i>LiMYC</i> and <i>LiTPS</i> Gene Families Involved in MeJA-Induced Terpene Accumulation in <i>Lagerstroemia indica</i> 'Whit III'.

    Plants (Basel, Switzerland)·2026
    Same author

    HiSymGeo: Hierarchical Context Symbiosis for Cross-View Object-Level Image Geo-Localization.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same author

    A high-resolution dataset on costs and greenhouse gas emissions of battery recycling in China.

    Scientific data·2026
    Same author

    Molecular glue degraders of HuR suppress BRAF-mutant colorectal cancer.

    Nature·2026
    Same author

    Hippo Signaling Suppresses Cell Ploidy and Tumorigenesis through Skp2.

    Cancer cell·2026
    Same journal

    Mask-guided Asymmetric Contrastive and Semantic Alignment for Unsupervised Person Re-Identification.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Hyperbolic Cycle Alignment for Infrared-Visible Image Fusion.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Learning Gaze Synthesizer via 3D-eye Controlled Diffusion and Cross-domain Feature Alignment.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Underlying Semantic Diffusion for Effective and Efficient In-Context Learning.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    DiffRES: Unleashing Text-to-Image Diffusion Models for Generative Referring Expression Segmentation without Information Leakage.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Location Matters: Frequency-Spatial Dual Space Adaptation for Cross-Domain Few-Shot Segmentation.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    查看所有相关文章

    这项研究引入了一种基于文本的人员搜索 (TBPS) 的新型半监督方法,克服了数据注释的挑战. 拟议的框架有效地处理噪音生成的数据,提高检索准确性,注释有限.

    科学领域:

    • 计算机科学 计算机科学
    • 人工智能的人工智能
    • 机器学习 机器学习

    背景情况:

    • 基于文本的人员搜索 (TBPS) 通常需要广泛的注释图像文本数据来实现完全监督的学习.
    • 获取人形图像的大型数据集的详细文本描述是实际上具有挑战性和资源密集的.

    研究的目的:

    • 在半监督的环境中探索和开发TBPS的有效方法,使用有限的注释数据.
    • 为了解决现有的TBPS方法的性能限制,由于图像-文本对的注释很少.

    主要方法:

    • 提出了一个两阶段的生成-然后-检索框架,从使用图像标题生成未注释图像的伪文本开始.
    • 引入了一个噪音强大的检索框架,包括混合补丁通道掩盖 (PC-Mask) 和噪音引导渐进训练 (NP-Train).
    • PC-Mask通过在补丁和通道级别上掩盖输入数据来改进模型架构,以减轻噪音杂的伪文本的过度匹配.
    • NP-Train通过基于伪文本噪音水平的逐步调度来增强训练,以实现稳健的学习.

    主要成果:

    • 拟议的半监督的TBPS框架在多个基准指标中显示出有前途的表现.
    • 噪音强策略 (PC-Mask和NP-Train) 显著提高了模型处理杂伪文本数据的能力.
    • 生成然后检索方法有效地利用有限的注释数据和生成的伪文本来改进检索.

    相关实验视频

    Last Updated: Jan 17, 2026

    Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
    09:09

    Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

    Published on: September 27, 2024

    814

    结论:

    • 半监督学习是TBPS的可行和有效方法,显著减少对完全注释的数据集的依赖.
    • 开发的噪音强度检索框架为改善在低注释场景中的TBPS性能提供了实际解决方案.
    • 这项研究通过解决数据稀缺性挑战,为更具可扩展性和高效的TBPS系统铺平了道路.