Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关实验视频

Updated: Jun 26, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

感知辅助变压器用于无监督对象重新识别.

Shuoyi Chen, Mang Ye, Xingping Dong

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
    |March 27, 2025
    PubMed
    概括
    此摘要是机器生成的。

    相关概念视频

    您也可能阅读

    相关文章

    通过共同作者、期刊和引用图与本文相关的文章。

    排序
    Same author

    LoRASculpt: Harmonious Low-Rank Adaptation for Multimodal Large Language Models.

    IEEE transactions on pattern analysis and machine intelligence·2026
    Same author

    Towards clinical-level interpretation of dental panoramic radiography using an instance-guided vision-language model.

    Nature biomedical engineering·2026
    Same author

    Systemic immune-inflammation index predicts post-thrombectomy outcomes and reveals a mediating role in the association between neurocardiac stress and prognosis: a multicenter study.

    Frontiers in neurology·2026
    Same author

    HiSymGeo: Hierarchical Context Symbiosis for Cross-View Object-Level Image Geo-Localization.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same author

    Holistic Invariant Retracing for Distortion-Resilient Multi-Modal Learning in Spatial Transcriptomics.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same author

    Differentiable Clustering Graph Convolutional Network for Hyperspectral Unmixing: Methodology and Benchmark.

    IEEE transactions on neural networks and learning systems·2026
    Same journal

    Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    GoP-based Quality Enhancement on Video Compression.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    查看所有相关文章

    本研究介绍了一个基于变压器的框架,用于无监督对象重新识别 (Re-ID),通过一种新的面具对齐策略来增强特征学习. 提出的方法实现了卓越的性能,超过了许多没有身份注释的监督方法.

    科学领域:

    • 计算机视觉 计算机视觉
    • 机器学习 机器学习
    • 人工智能的人工智能

    背景情况:

    • 无监督对象重新识别 (Re-ID) 传统上使用卷积神经网络 (CNN) 来进行特征提取和伪标签.
    • 在捕捉远程依赖和整合全球信息方面,CNN存在局限性,阻碍了复杂场景中的性能.
    • 视觉转换器 (ViT) 为各种数据结构提供了卓越的稳定性和建模能力,显示了Re-ID任务的前景.

    研究的目的:

    • 探索视觉转换器在无监督对象重新识别 (Re-ID) 中的潜力.
    • 提出一种新的基于变压器的框架 (PAT),以增强超越类别级监督的特征学习.
    • 改进无监督Re-ID.中的细粒度特征对齐和实例级歧视性学习.

    主要方法:

    • 提出了一个基于变压器的感知辅助框架 (PAT),用于无监督的Re-ID.
    • 引入了针对目标的面具对齐 (TMA) 策略,以利用低级视觉线索,并使用伪标签指导细粒度特征对齐.
    • 开发了一种感知融合特征增强 (PFA) 方法,以优化实例级别的歧视性学习.

    主要成果:

    • 与最先进的方法相比,PAT框架在多个Re-ID数据集上表现出卓越的性能和稳定性.
    • 拟议的TMA策略有效地纳入了本地像素信息,以改善歧视性特征的学习.

    更多相关视频

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
    04:23

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

    Published on: April 21, 2023

    End-To-End Deep Neural Network for Salient Object Detection in Complex Environments
    03:31

    End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

    Published on: December 15, 2023

    相关实验视频

    Last Updated: Jun 26, 2026

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
    04:23

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

    Published on: April 21, 2023

    End-To-End Deep Neural Network for Salient Object Detection in Complex Environments
    03:31

    End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

    Published on: December 15, 2023

  • 该方法取得的结果与许多监督的Re-ID方法相似或更好,尽管没有监督.
  • 结论:

    • 视觉转换器对于无监督的对象重新识别非常有效,特别是当与增强细粒度特征学习的策略相结合时.
    • 拟议的PAT框架,包括TMA和PFA,通过平衡歧视性学习和详细理解,为无监督的Re-ID提供了一个强大的方法.
    • 该方法在没有身份注释的情况下实现强性能的能力突出了其在实际应用中的潜力.