Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Stereotype Content Model

Stereotype Content Model

The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...

Self-Evaluation Maintenance Model

Self-Evaluation Maintenance Model

The Self-Evaluation Maintenance (SEM) model offers a psychological framework to understand how individuals’ self-esteem is influenced by the achievements of others, particularly those with whom they share close personal bonds. The SEM model operates when personal rather than social identity guides individuals. Central to this model is the notion that individuals have an inherent desire to preserve a favorable self-image, which is continuously shaped by interpersonal comparisons and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Effectiveness of Family Physician-Led Home-Based Long-Term Care on Quality of Life in Older Adults With Disability: A Quasi-Experimental Study.

Journal of the American Medical Directors Association·2026

Same author

Cubosomes with pH-triggered cubic phase transition enable cytosolic mRNA delivery for acute respiratory distress syndrome therapy.

Journal of controlled release : official journal of the Controlled Release Society·2026

Same author

STELLAR: A flexible ensemble learning framework integrating rare variants to enhance polygenic risk prediction.

medRxiv : the preprint server for health sciences·2026

Same author

Biomass-Derived Hydrogels for Load-Bearing Connective Tissue Repair: Integrative Reinforcement, Bio-Functional Design, and Emerging Pathways Toward Clinical Translation.

Advanced healthcare materials·2026

Same author

A PEI-Based Surface Strategy for Robust <b><sup>177</sup></b>Lu Incorporation into Biodegradable Chitosan Microspheres: A Potential Platform for Hepatocellular Carcinoma Radioembolization.

Biomacromolecules·2026

Same author

Monolithic Integration of Carbon Nanotube-Based Complementary Field-Effect Transistors with 3D-Stacked Photodiodes for Unified Sensing and Computing.

ACS nano·2026

Same journal

TraNce: Type-aware hypergraph neural network with biological mediators for drug repositioning.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Decentralized ADMM for factorization-based Low-rank matrix estimation.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Memristive neuromorphic circuit design inspired by the neural mechanisms of conditioned fear.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Q-learning based asynchronous Boolean control networks stabilization with data loss.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

New results on prescribed-time synchronization of complex networks via intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 5, 2026

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

Published on: August 13, 2014

RefSAM: Efficiently adapting segmenting anything model for referring video object segmentation.

Yonglin Li¹, Jing Zhang², Xiao Teng¹

¹College of Computer Science and Technology, National University of Defense Technology, Changsha, 410073, China.

Neural Networks : the Official Journal of the International Neural Network Society

|August 25, 2025

Summary

This summary is machine-generated.

The RefSAM model enhances the Segment Anything Model (SAM) for referring video object segmentation (RVOS) by integrating multi-view and multi-modal information. This approach improves segmentation accuracy by effectively fusing language and visual features.

Keywords:

Multimodal learning Object segmentation Segment anything Vision transformer

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Related Experiment Videos

Last Updated: May 5, 2026

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

Published on: August 13, 2014

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

The Segment Anything Model (SAM) demonstrates strong image segmentation capabilities but struggles with referring video object segmentation (RVOS).
Existing RVOS methods often require precise user prompts and lack robust multi-modal understanding (language and vision).

Purpose of the Study:

To adapt SAM for effective referring video object segmentation (RVOS).
To enhance cross-modality learning by integrating diverse visual and linguistic information from successive video frames.

Main Methods:

Introduced the RefSAM model, adapting SAM with a Cross-Modal MLP for text-to-embedding projection.
Developed a hierarchical dense attention module for fusing visual-semantic information and sparse embeddings.
Incorporated an implicit tracking module for historical context and a parameter-efficient tuning strategy for feature alignment.

Main Results:

RefSAM effectively incorporates multi-view information from diverse modalities and successive frames.
The model demonstrates superior performance on RVOS tasks compared to existing methods.
Ablation studies confirm the efficacy of the proposed design choices.

Conclusions:

RefSAM significantly advances the application of SAM for referring video object segmentation.
The model's ability to fuse multi-modal information and leverage temporal context offers a robust solution for RVOS.