Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

Search research articles

相关实验视频

Updated: Jun 26, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

感知辅助变压器用于无监督对象重新识别.

Shuoyi Chen, Mang Ye, Xingping Dong

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

|March 27, 2025

概括

此摘要是机器生成的。

相关概念视频

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

LoRASculpt: Harmonious Low-Rank Adaptation for Multimodal Large Language Models.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

Towards clinical-level interpretation of dental panoramic radiography using an instance-guided vision-language model.

Nature biomedical engineering·2026

Same author

Systemic immune-inflammation index predicts post-thrombectomy outcomes and reveals a mediating role in the association between neurocardiac stress and prognosis: a multicenter study.

Frontiers in neurology·2026

Same author

HiSymGeo: Hierarchical Context Symbiosis for Cross-View Object-Level Image Geo-Localization.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Holistic Invariant Retracing for Distortion-Resilient Multi-Modal Learning in Spatial Transcriptomics.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Differentiable Clustering Graph Convolutional Network for Hyperspectral Unmixing: Methodology and Benchmark.

IEEE transactions on neural networks and learning systems·2026

Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

查看所有相关文章

本研究介绍了一个基于变压器的框架,用于无监督对象重新识别 (Re-ID),通过一种新的面具对齐策略来增强特征学习. 提出的方法实现了卓越的性能,超过了许多没有身份注释的监督方法.

科学领域:

计算机视觉计算机视觉
机器学习机器学习
人工智能的人工智能

背景情况:

无监督对象重新识别 (Re-ID) 传统上使用卷积神经网络 (CNN) 来进行特征提取和伪标签.
在捕捉远程依赖和整合全球信息方面,CNN存在局限性,阻碍了复杂场景中的性能.
视觉转换器 (ViT) 为各种数据结构提供了卓越的稳定性和建模能力,显示了Re-ID任务的前景.

研究的目的:

探索视觉转换器在无监督对象重新识别 (Re-ID) 中的潜力.
提出一种新的基于变压器的框架 (PAT),以增强超越类别级监督的特征学习.
改进无监督Re-ID.中的细粒度特征对齐和实例级歧视性学习.

主要方法:

提出了一个基于变压器的感知辅助框架 (PAT),用于无监督的Re-ID.
引入了针对目标的面具对齐 (TMA) 策略,以利用低级视觉线索,并使用伪标签指导细粒度特征对齐.
开发了一种感知融合特征增强 (PFA) 方法,以优化实例级别的歧视性学习.

主要成果:

与最先进的方法相比,PAT框架在多个Re-ID数据集上表现出卓越的性能和稳定性.
拟议的TMA策略有效地纳入了本地像素信息,以改善歧视性特征的学习.

更多相关视频

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

Published on: December 15, 2023

相关实验视频

Last Updated: Jun 26, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

Published on: December 15, 2023

该方法取得的结果与许多监督的Re-ID方法相似或更好,尽管没有监督.

结论:

视觉转换器对于无监督的对象重新识别非常有效,特别是当与增强细粒度特征学习的策略相结合时.
拟议的PAT框架,包括TMA和PFA,通过平衡歧视性学习和详细理解,为无监督的Re-ID提供了一个强大的方法.
该方法在没有身份注释的情况下实现强性能的能力突出了其在实际应用中的潜力.