Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Transformer-Based Models for Predicting Molecular Structures from Infrared Spectra Using Patch-Based Self-Attention.

The journal of physical chemistry. A·2025

Same author

Structure and Intensity Unbiased Translation for 2D Medical Image Segmentation.

IEEE transactions on pattern analysis and machine intelligence·2024

Same author

3D-printed porous zinc scaffold combined with bioactive serum exosomes promotes bone defect repair in rabbit radius.

Aging·2024

Same author

Disrupted Functional Brain Network Architecture in Sufferers with Boxing-Related Repeated Mild Traumatic Brain Injury: A Resting-State EEG Study.

Journal of integrative neuroscience·2024

Same author

[Research progress of diagnostic and therapeutic value of carbon dioxide-derived indicators in patients with sepsis].

Zhonghua wei zhong bing ji jiu yi xue·2024

Same author

Jingfang granules protects against intracerebral hemorrhage by inhibiting neuroinflammation and protecting blood-brain barrier damage.

Aging·2024

Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 1, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Weakly-Supervised RGBD Video Object Segmentation.

Jinyu Yang, Mingqi Gao, Feng Zheng

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|March 12, 2024

Summary

This summary is machine-generated.

This study introduces DepthVOS, a new benchmark for RGBD video object segmentation (VOS), and FusedCDNet, a weakly-supervised model that achieves strong performance. This advances RGBD VOS research by reducing annotation costs and improving robustness in complex scenes.

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Related Experiment Videos

Last Updated: Jul 1, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Area of Science:

Computer Vision
Machine Learning
Robotics

Background:

RGBD video object segmentation (VOS) leverages depth data for enhanced accuracy and robustness.
The RGBD VOS task remains underexplored due to challenges in data collection and annotation.

Purpose of the Study:

Introduce a new benchmark, DepthVOS, for RGBD VOS.
Propose FusedCDNet, a novel baseline model for weakly-supervised RGBD VOS.
Address the high cost of annotation and complexity of scenes in RGBD VOS.

Main Methods:

Developed the DepthVOS benchmark with 350 videos (55k+ frames), annotated with masks and bounding boxes.
Proposed Fused Color-Depth Network (FusedCDNet) with a cross-modal fusion module.
Implemented a weakly-supervised training strategy using only bounding boxes and a bounding box guideline in the first frame for inference.

Main Results:

FusedCDNet demonstrates performance on par with top fully-supervised algorithms.
The proposed weakly-supervised approach significantly reduces annotation burden.
The cross-modal fusion effectively handles complex scenes.

Conclusions:

The DepthVOS benchmark and FusedCDNet provide a strong foundation for advancing RGBD VOS research.
Weakly-supervised methods are viable and effective for RGBD VOS.
Open-sourcing the project aims to foster further development in the field.