Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Transformer-Based Models for Predicting Molecular Structures from Infrared Spectra Using Patch-Based Self-Attention.

The journal of physical chemistry. A·2025
Same author

Structure and Intensity Unbiased Translation for 2D Medical Image Segmentation.

IEEE transactions on pattern analysis and machine intelligence·2024
Same author

3D-printed porous zinc scaffold combined with bioactive serum exosomes promotes bone defect repair in rabbit radius.

Aging·2024
Same author

Disrupted Functional Brain Network Architecture in Sufferers with Boxing-Related Repeated Mild Traumatic Brain Injury: A Resting-State EEG Study.

Journal of integrative neuroscience·2024
Same author

[Research progress of diagnostic and therapeutic value of carbon dioxide-derived indicators in patients with sepsis].

Zhonghua wei zhong bing ji jiu yi xue·2024
Same author

Jingfang granules protects against intracerebral hemorrhage by inhibiting neuroinflammation and protecting blood-brain barrier damage.

Aging·2024
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Jul 1, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K

Weakly-Supervised RGBD Video Object Segmentation.

Jinyu Yang, Mingqi Gao, Feng Zheng

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |March 12, 2024
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces DepthVOS, a new benchmark for RGBD video object segmentation (VOS), and FusedCDNet, a weakly-supervised model that achieves strong performance. This advances RGBD VOS research by reducing annotation costs and improving robustness in complex scenes.

    More Related Videos

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.7K
    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    533

    Related Experiment Videos

    Last Updated: Jul 1, 2025

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.0K
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.7K
    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    533

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Robotics

    Background:

    • RGBD video object segmentation (VOS) leverages depth data for enhanced accuracy and robustness.
    • The RGBD VOS task remains underexplored due to challenges in data collection and annotation.

    Purpose of the Study:

    • Introduce a new benchmark, DepthVOS, for RGBD VOS.
    • Propose FusedCDNet, a novel baseline model for weakly-supervised RGBD VOS.
    • Address the high cost of annotation and complexity of scenes in RGBD VOS.

    Main Methods:

    • Developed the DepthVOS benchmark with 350 videos (55k+ frames), annotated with masks and bounding boxes.
    • Proposed Fused Color-Depth Network (FusedCDNet) with a cross-modal fusion module.
    • Implemented a weakly-supervised training strategy using only bounding boxes and a bounding box guideline in the first frame for inference.

    Main Results:

    • FusedCDNet demonstrates performance on par with top fully-supervised algorithms.
    • The proposed weakly-supervised approach significantly reduces annotation burden.
    • The cross-modal fusion effectively handles complex scenes.

    Conclusions:

    • The DepthVOS benchmark and FusedCDNet provide a strong foundation for advancing RGBD VOS research.
    • Weakly-supervised methods are viable and effective for RGBD VOS.
    • Open-sourcing the project aims to foster further development in the field.