Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Holistic Invariant Retracing for Distortion-Resilient Multi-Modal Learning in Spatial Transcriptomics.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Demonstration of efficient predictive surrogates for large-scale quantum processors.

Nature communications·2026
Same author

A DeepSeek-powered AI system for automated chest radiograph interpretation in clinical practice.

Nature communications·2026
Same author

NoisePO: Efficient Semantic Noise Generation and Ranking for Diffusion-Based Text-to-Image Synthesis.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Stability and Generalization for Distributed SGDA.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Aug 26, 2025

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.9K

Improving Video Instance Segmentation via Temporal Pyramid Routing.

Xiangtai Li, Hao He, Yibo Yang

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |October 4, 2022
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces Temporal Pyramid Routing (TPR), a novel strategy for video instance segmentation (VIS). TPR effectively integrates temporal and multi-scale features, significantly improving detection, segmentation, and tracking of instances in videos.

    More Related Videos

    A Comprehensive Protocol for Manual Segmentation of the Medial Temporal Lobe Structures
    12:30

    A Comprehensive Protocol for Manual Segmentation of the Medial Temporal Lobe Structures

    Published on: July 2, 2014

    20.4K
    Profiling Maternal Behavior Responses During Whole-Brain Imaging
    07:12

    Profiling Maternal Behavior Responses During Whole-Brain Imaging

    Published on: January 24, 2025

    1.0K

    Related Experiment Videos

    Last Updated: Aug 26, 2025

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.9K
    A Comprehensive Protocol for Manual Segmentation of the Medial Temporal Lobe Structures
    12:30

    A Comprehensive Protocol for Manual Segmentation of the Medial Temporal Lobe Structures

    Published on: July 2, 2014

    20.4K
    Profiling Maternal Behavior Responses During Whole-Brain Imaging
    07:12

    Profiling Maternal Behavior Responses During Whole-Brain Imaging

    Published on: January 24, 2025

    1.0K

    Area of Science:

    • Computer Vision
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Video Instance Segmentation (VIS) is a complex multi-task problem involving detection, segmentation, and tracking of instances within video sequences.
    • Current VIS methods often neglect either temporal dynamics or multi-scale feature information, limiting their performance.
    • Existing approaches rely on single-frame or single-scale features, failing to capture comprehensive spatio-temporal context.

    Purpose of the Study:

    • To propose a novel Temporal Pyramid Routing (TPR) strategy for enhancing Video Instance Segmentation (VIS).
    • To effectively incorporate both temporal and multi-scale feature information into VIS models.
    • To develop a lightweight and adaptable module for improving existing instance segmentation frameworks.

    Main Methods:

    • The proposed Temporal Pyramid Routing (TPR) strategy conditionally aligns and aggregates pixel-level features from adjacent frames.
    • TPR utilizes two key components: Dynamic Aligned Cell Routing (DACR) for temporal alignment and gating, and Cross Pyramid Routing (CPR) for cross-scale feature transfer.
    • The method is designed as a plug-and-play module, easily integrated into existing instance segmentation architectures.

    Main Results:

    • Experiments on YouTube-VIS (2019, 2021) and Cityscapes-VPS datasets demonstrate the effectiveness of TPR.
    • The proposed approach significantly enhances the performance of state-of-the-art instance and panoptic segmentation methods.
    • TPR proves to be an efficient and effective module for improving video instance segmentation.

    Conclusions:

    • Temporal Pyramid Routing (TPR) successfully addresses the limitations of existing VIS methods by integrating temporal and multi-scale information.
    • The proposed method offers a lightweight and versatile solution for advancing video instance segmentation.
    • TPR shows strong potential for practical applications requiring accurate and efficient instance tracking in videos.