Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Orientational order in dense suspensions of elliptical particles in the non-Stokesian regime.

Soft matter·2020

Same author

Shear-driven segregation of dry granular materials with different friction coefficients.

Soft matter·2016

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 3, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Cluster2Former: Semisupervised Clustering Transformers for Video Instance Segmentation.

Áron Fóthi¹, Adrián Szlatincsán¹, Ellák Somfai^1,2

¹Department of Artificial Intelligence, ELTE Eötvös Loránd University, 1053 Budapest, Hungary.

Sensors (Basel, Switzerland)

|February 10, 2024

Summary

This summary is machine-generated.

This study introduces Cluster2Former, a semisupervised learning method for video instance segmentation. It uses minimal scribble annotations to achieve competitive results, reducing data labeling costs.

Keywords:

instance segmentation semisupervised learning transformers video processing

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Related Experiment Videos

Last Updated: Jul 3, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Area of Science:

Computer Science
Artificial Intelligence
Machine Learning

Background:

Video instance segmentation is crucial for understanding dynamic scenes.
Traditional methods require extensive pixel-level annotations, which are costly and time-consuming.
Semisupervised learning offers a potential solution to reduce annotation burden.

Purpose of the Study:

To develop a novel semisupervised approach for video instance segmentation.
To reduce the reliance on fully annotated datasets by utilizing lightweight annotations.
To improve the cost-effectiveness and efficiency of video instance segmentation.

Main Methods:

Proposed the Cluster2Former model, augmenting existing architectures like Mask2Former.
Employed scribble-based annotations for training.
Introduced a similarity-based constraint loss to effectively handle partial annotations.

Main Results:

Achieved competitive performance on standard video instance segmentation benchmarks.
Demonstrated effectiveness with as little as 0.5% of annotated pixels.
Showcased the model's ability to handle limited annotation resources.

Conclusions:

Cluster2Former provides a viable and efficient solution for video instance segmentation.
The approach significantly lowers annotation costs and computational requirements.
It is particularly beneficial for applications with scarce labeled data.