Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Exploring the Stochastic Regularisation in Normalisation Layers for Semi-Supervised Learning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Embodied Spatial Affordance: Spatial-Aware Affordance Learning for Embodied Navigation and Manipulation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Paving the Way for Point Cloud Video Representation Learning Using a PDE Model.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

A cross-sectional study of associations between constitutional morphological features and individual radiographic phenotypes of hip osteoarthritis.

Osteoarthritis and cartilage·2026
Same author

ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Improving anomaly detection with foundation-model synthesis and wavelet-domain attention.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Style-Aware Contrastive Test-Time Adaptation: A Dual-Cache Model for Robust Vision-Language Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Semantic Frame Interpolation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Physics-Guided Cross-Modal Decoupling with Test-Time Adaptation for Hyperspectral Image Restoration.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Jun 20, 2025

Multi-color Localization Microscopy of Single Membrane Proteins in Organelles of Live Mammalian Cells
11:06

Multi-color Localization Microscopy of Single Membrane Proteins in Organelles of Live Mammalian Cells

Published on: June 30, 2018

8.5K

Exploring Multi-Modal Spatial-Temporal Contexts for High-Performance RGB-T Tracking.

Tianlu Zhang, Qiang Jiao, Qiang Zhang

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |July 19, 2024
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new Multi-modal Spatial-Temporal Context (MMSTC) network for RGB-T tracking. The MMSTC network effectively utilizes multi-modal spatial relationships and temporal consistencies for improved object tracking performance.

    More Related Videos

    High-resolution Spatiotemporal Analysis of Receptor Dynamics by Single-molecule Fluorescence Microscopy
    15:13

    High-resolution Spatiotemporal Analysis of Receptor Dynamics by Single-molecule Fluorescence Microscopy

    Published on: July 25, 2014

    11.4K
    Automated Two-dimensional Spatiotemporal Analysis of Mobile Single-molecule FRET Probes
    08:26

    Automated Two-dimensional Spatiotemporal Analysis of Mobile Single-molecule FRET Probes

    Published on: November 23, 2021

    2.5K

    Related Experiment Videos

    Last Updated: Jun 20, 2025

    Multi-color Localization Microscopy of Single Membrane Proteins in Organelles of Live Mammalian Cells
    11:06

    Multi-color Localization Microscopy of Single Membrane Proteins in Organelles of Live Mammalian Cells

    Published on: June 30, 2018

    8.5K
    High-resolution Spatiotemporal Analysis of Receptor Dynamics by Single-molecule Fluorescence Microscopy
    15:13

    High-resolution Spatiotemporal Analysis of Receptor Dynamics by Single-molecule Fluorescence Microscopy

    Published on: July 25, 2014

    11.4K
    Automated Two-dimensional Spatiotemporal Analysis of Mobile Single-molecule FRET Probes
    08:26

    Automated Two-dimensional Spatiotemporal Analysis of Mobile Single-molecule FRET Probes

    Published on: November 23, 2021

    2.5K

    Area of Science:

    • Computer Vision
    • Artificial Intelligence

    Background:

    • RGB-T tracking relies on spatial relationships within multi-modal data and across frames.
    • Existing trackers often neglect these crucial multi-modal spatial and temporal aspects.
    • This leads to limitations in robust tracking, especially in complex scenarios.

    Purpose of the Study:

    • To propose a novel Multi-modal Spatial-Temporal Context (MMSTC) network for enhancing RGB-T tracking.
    • To address the limitations of existing trackers in utilizing multi-modal spatial relationships and temporal consistencies.
    • To improve the robustness and applicability of RGB-T tracking in challenging environments.

    Main Methods:

    • Developed a Transformer-based network (MMSTC) for RGB-T tracking.
    • Introduced a Multi-modal Transformer Encoder (MMTE) for encoding spatial contexts and fusing multi-modal features.
    • Proposed a Quality-aware Transformer Decoder (QATD) for propagating temporal tracking cues.

    Main Results:

    • The MMSTC network demonstrated superior performance on five prevalent RGB-T tracking benchmarks.
    • Achieved new state-of-the-art results compared to existing RGB-T tracking methods.
    • The proposed network architecture is versatile and can be integrated into various tracking frameworks.

    Conclusions:

    • The MMSTC network effectively leverages multi-modal spatial context and temporal information for robust RGB-T tracking.
    • The proposed approach significantly advances the state-of-the-art in RGB-T tracking.
    • The MMSTC network offers a promising direction for future research and practical applications in RGB-T tracking.