Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Dec 24, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.5K

Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments.

Kai Niu, Yan Huang, Wanli Ouyang

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |April 11, 2020
    PubMed
    Summary
    This summary is machine-generated.

    Related Concept Videos

    You might also read

    Related Articles

    Articles linked to this work by shared authors, journal, and citation graph.

    Sort by
    Same author

    Semantic Channel Capacity of Rayleigh Fading Channels Based on Synonymous Mapping.

    Entropy (Basel, Switzerland)·2026
    Same author

    Semantic Algorithmic Information Theory: From Kolmogorov Complexity to Semantic Equivalence.

    Entropy (Basel, Switzerland)·2026
    Same author

    Pseudo Sentences Evaluation and Quality-Aware Robust Learning for Unsupervised Text-Based Person Search.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same author

    BA-UNet: A Boundary Augmented Segmentation Network for Cervical Cancer Radiotherapy.

    Journal of imaging informatics in medicine·2026
    Same author

    Fully automated, deep learning, cardiac CT-based multimodal network for cardiovascular risk stratification in high-risk perioperative patients.

    European heart journal. Digital health·2026
    Same author

    A deep learning-based method for evaluating the fitting states of orthokeratology lenses using fluorescein staining videos.

    Contact lens & anterior eye : the journal of the British Contact Lens Association·2026
    Same journal

    Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    GoP-based Quality Enhancement on Video Compression.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    See all related articles

    This study introduces a novel Multi-granularity Image-text Alignments (MIA) model for description-based person re-identification (Re-id). The MIA model effectively addresses cross-modal fine-grained challenges, significantly improving similarity evaluation in video surveillance.

    Area of Science:

    • Computer Vision
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Description-based person re-identification (Re-id) is crucial for video surveillance.
    • Challenges include modality heterogeneity (cross-modal problem) and fine-grained matching within categories.
    • Existing methods struggle with direct image-description similarity due to these complexities.

    Purpose of the Study:

    • To propose a novel Multi-granularity Image-text Alignments (MIA) model.
    • To alleviate the cross-modal fine-grained problem in description-based person Re-id.
    • To enhance similarity evaluation between images and textual descriptions.

    Main Methods:

    • Hierarchical alignment across three granularities: global-global, global-local, and local-local.

    More Related Videos

    Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns
    13:44

    Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

    Published on: August 30, 2013

    43.5K

    Related Experiment Videos

    Last Updated: Dec 24, 2025

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.5K
    Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns
    13:44

    Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

    Published on: August 30, 2013

    43.5K
  • Global Contrast (GC) module for global context matching.
  • Relation-guided Global-local Alignment (RGA) module for adaptive component highlighting.
  • Bi-directional Fine-grained Matching (BFM) module for part-phrase matching.
  • End-to-end training with a proposed step training strategy.
  • Main Results:

    • Achieved state-of-the-art performance on the CUHK-PEDES dataset.
    • Outperformed previous methods by a significant margin.
    • Demonstrated the effectiveness of multi-granularity alignments for Re-id.

    Conclusions:

    • The MIA model effectively addresses the cross-modal fine-grained problem in person Re-id.
    • Hierarchical alignments at multiple granularities improve similarity evaluation.
    • The proposed method offers a robust and efficient solution for description-based person Re-id tasks.