Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Stereotype Content Model02:16

Stereotype Content Model

13.1K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
13.1K
Self-Evaluation Maintenance Model01:29

Self-Evaluation Maintenance Model

427
The Self-Evaluation Maintenance (SEM) model offers a psychological framework to understand how individuals’ self-esteem is influenced by the achievements of others, particularly those with whom they share close personal bonds. The SEM model operates when personal rather than social identity guides individuals. Central to this model is the notion that individuals have an inherent desire to preserve a favorable self-image, which is continuously shaped by interpersonal comparisons and...
427

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Effectiveness of Family Physician-Led Home-Based Long-Term Care on Quality of Life in Older Adults With Disability: A Quasi-Experimental Study.

Journal of the American Medical Directors Association·2026
Same author

Cubosomes with pH-triggered cubic phase transition enable cytosolic mRNA delivery for acute respiratory distress syndrome therapy.

Journal of controlled release : official journal of the Controlled Release Society·2026
Same author

STELLAR: A flexible ensemble learning framework integrating rare variants to enhance polygenic risk prediction.

medRxiv : the preprint server for health sciences·2026
Same author

Biomass-Derived Hydrogels for Load-Bearing Connective Tissue Repair: Integrative Reinforcement, Bio-Functional Design, and Emerging Pathways Toward Clinical Translation.

Advanced healthcare materials·2026
Same author

A PEI-Based Surface Strategy for Robust <b><sup>177</sup></b>Lu Incorporation into Biodegradable Chitosan Microspheres: A Potential Platform for Hepatocellular Carcinoma Radioembolization.

Biomacromolecules·2026
Same author

Monolithic Integration of Carbon Nanotube-Based Complementary Field-Effect Transistors with 3D-Stacked Photodiodes for Unified Sensing and Computing.

ACS nano·2026
Same journal

TraNce: Type-aware hypergraph neural network with biological mediators for drug repositioning.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Decentralized ADMM for factorization-based Low-rank matrix estimation.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Memristive neuromorphic circuit design inspired by the neural mechanisms of conditioned fear.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Q-learning based asynchronous Boolean control networks stabilization with data loss.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

New results on prescribed-time synchronization of complex networks via intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: May 5, 2026

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data
12:08

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

Published on: August 13, 2014

24.7K

RefSAM: Efficiently adapting segmenting anything model for referring video object segmentation.

Yonglin Li1, Jing Zhang2, Xiao Teng1

  • 1College of Computer Science and Technology, National University of Defense Technology, Changsha, 410073, China.

Neural Networks : the Official Journal of the International Neural Network Society
|August 25, 2025
PubMed
Summary
This summary is machine-generated.

The RefSAM model enhances the Segment Anything Model (SAM) for referring video object segmentation (RVOS) by integrating multi-view and multi-modal information. This approach improves segmentation accuracy by effectively fusing language and visual features.

Keywords:
Multimodal learningObject segmentationSegment anythingVision transformer

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.9K

Related Experiment Videos

Last Updated: May 5, 2026

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data
12:08

From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data

Published on: August 13, 2014

24.7K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.9K

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • The Segment Anything Model (SAM) demonstrates strong image segmentation capabilities but struggles with referring video object segmentation (RVOS).
  • Existing RVOS methods often require precise user prompts and lack robust multi-modal understanding (language and vision).

Purpose of the Study:

  • To adapt SAM for effective referring video object segmentation (RVOS).
  • To enhance cross-modality learning by integrating diverse visual and linguistic information from successive video frames.

Main Methods:

  • Introduced the RefSAM model, adapting SAM with a Cross-Modal MLP for text-to-embedding projection.
  • Developed a hierarchical dense attention module for fusing visual-semantic information and sparse embeddings.
  • Incorporated an implicit tracking module for historical context and a parameter-efficient tuning strategy for feature alignment.

Main Results:

  • RefSAM effectively incorporates multi-view information from diverse modalities and successive frames.
  • The model demonstrates superior performance on RVOS tasks compared to existing methods.
  • Ablation studies confirm the efficacy of the proposed design choices.

Conclusions:

  • RefSAM significantly advances the application of SAM for referring video object segmentation.
  • The model's ability to fuse multi-modal information and leverage temporal context offers a robust solution for RVOS.