Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: May 1, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

1.3K

Soft Supervision Guided Spatial-Temporal Refinement Network For Video-based Visible-Infrared Person

Jinxing Li, Chuhao Zhou, Rundong Li

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |April 29, 2026
    PubMed
    Summary
    This summary is machine-generated.

    Related Concept Videos

    You might also read

    Related Articles

    Articles linked to this work by shared authors, journal, and citation graph.

    Sort by
    Same author

    Progressive Fusion of Multi-Scale Mamba Context and Local Detail Priors for Infrared Small Target Detection.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same author

    Performance of Age-Adjusted Whole Genome Sequencing Telomere Length in Idiopathic Pulmonary Fibrosis.

    American journal of respiratory and critical care medicine·2026
    Same author

    Publisher Correction: Whole genome sequence analysis of pulmonary function and COPD in 44,287 multi-ancestry participants.

    Genome biology·2026
    Same author

    Optical Coherence Tomography Biomarkers Differentiate Epiretinal Membranes Secondary to Retinal Detachment from Idiopathic Epiretinal Membranes.

    Journal of vitreoretinal diseases·2026
    Same author

    Arrhythmia Burden and Clinical Responses Under Continuous Monitoring in Heart Failure: Observations From the ALLEVIATE-HF Trial.

    Journal of the American College of Cardiology·2026
    Same author

    Risk-Based Nurse-Managed Personalized Heart Failure Interventions: The ALLEVIATE-HF Trial.

    Journal of the American College of Cardiology·2026
    Same journal

    CLASH-CTTA: Class-Wise Shift-Aware Hierarchical Continual Test-Time Adaptation.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Voxel-based Point Cloud Geometry Compression with Space-to-Channel Context.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    RIGI: Rectifying Image-to-3D Generation Inconsistency via Uncertainty-Aware Learning.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    DA-Cal: Towards Cross-Domain Calibration in Semantic Segmentation.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Multi-Dimensional Quality Assessment for Single-Image-to-3D Contents: Dataset and Model.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Enhancing Underwater Light Field Images via Global Geometry-Aware Diffusion Process.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    See all related articles

    This study introduces a new method for video-based cross-modal person re-identification (Re-ID) using the HITSZ-PVCM dataset. The Soft Supervision guided Spatial-Temporal Refinement (S3TR) network improves pedestrian recognition across different camera modes.

    Area of Science:

    • Computer Vision
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Person re-identification (Re-ID) enables tracking individuals across cameras using visible and infrared modes.
    • Video-based Re-ID offers richer appearance details than still images but faces challenges in capturing fine-grained information.
    • Existing methods often lose details and struggle with intra-class variations, limiting model generalization.

    Purpose of the Study:

    • To develop a novel network for video-based cross-modal person Re-ID.
    • To address the loss of fine-grained details in temporal representations.
    • To improve model generalization by overcoming limitations of traditional metric losses.

    Main Methods:

    • Introduction of the Soft Supervision guided Spatial-Temporal Refinement (S3TR) network.

    More Related Videos

    Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies
    07:34

    Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies

    Published on: November 7, 2025

    548

    Related Experiment Videos

    Last Updated: May 1, 2026

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    1.3K
    Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies
    07:34

    Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies

    Published on: November 7, 2025

    548
  • Frame refinement guided by coarse temporal features for discriminative feature extraction.
  • Global-local mutual learning to bridge the modality gap and a novel soft-clustering center loss for improved generalization.
  • Main Results:

    • The proposed S3TR network effectively refines features and captures fine-grained details.
    • The global-local mutual learning module successfully reduces the modality gap.
    • The soft-clustering center loss enhances model generalization by considering group-wise similarities.

    Conclusions:

    • S3TR achieves superior performance in video-based cross-modal person Re-ID.
    • The HITSZ-PVCM dataset is the largest to date for this task.
    • The proposed methods offer a significant advancement in person re-identification accuracy and generalization.