Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Jul 1, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

A Benchmark Dataset and Saliency-Guided Stacked Autoencoders for Video-Based Salient Object Detection.

Jia Li, Changqun Xia, Xiaowu Chen

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |October 14, 2017
    PubMed
    Summary
    This summary is machine-generated.

    Related Concept Videos

    You might also read

    Related Articles

    Articles linked to this work by shared authors, journal, and citation graph.

    Sort by
    Same author

    Learning When and How to Update Memory for Video Object Segmentation.

    IEEE transactions on pattern analysis and machine intelligence·2026
    Same author

    Rethinking Lightweight Salient Object Detection via Network Depth-Width Tradeoff.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2023
    Same author

    Boosting Broader Receptive Fields for Salient Object Detection.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2023
    Same author

    Salient Object Detection With Purificatory Mechanism and Structural Similarity Loss.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2021
    Same author

    Semantic Object Segmentation in Tagged Videos via Detection.

    IEEE transactions on pattern analysis and machine intelligence·2017
    Same author

    Transparent metals for ultrabroadband electromagnetic waves.

    Advanced materials (Deerfield Beach, Fla.)·2012
    Same journal

    Hyperbolic Cycle Alignment for Infrared-Visible Image Fusion.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Learning Gaze Synthesizer via 3D-eye Controlled Diffusion and Cross-domain Feature Alignment.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Underlying Semantic Diffusion for Effective and Efficient In-Context Learning.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    DiffRES: Unleashing Text-to-Image Diffusion Models for Generative Referring Expression Segmentation without Information Leakage.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    Location Matters: Frequency-Spatial Dual Space Adaptation for Cross-Domain Few-Shot Segmentation.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    Same journal

    BayeTopo: Bayesian-based Topology-guided Learning for Vascular Imaging Segmentation.

    IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
    See all related articles

    Researchers developed a new dataset and unsupervised method for video salient object detection (SOD). The approach effectively identifies consistently salient objects across video frames, outperforming most existing models.

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Human-Computer Interaction

    Background:

    • Image-based salient object detection (SOD) is well-researched, but video-based SOD lacks large-scale, well-annotated datasets.
    • Existing video SOD research is limited by the absence of unambiguous definitions and annotations for salient objects in dynamic scenes.

    Purpose of the Study:

    • To introduce the largest dataset to date for video-based salient object detection (SOD).
    • To propose an unsupervised baseline method for video SOD using saliency-guided stacked autoencoders.
    • To establish a challenging benchmark for advancing video SOD research.

    Main Methods:

    • A new dataset comprising 200 videos with manually annotated objects/regions on 7650 keyframes and eye-tracking data from 23 subjects was created.

    More Related Videos

    End-To-End Deep Neural Network for Salient Object Detection in Complex Environments
    03:31

    End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

    Published on: December 15, 2023

    Related Experiment Videos

    Last Updated: Jul 1, 2026

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    End-To-End Deep Neural Network for Salient Object Detection in Complex Environments
    03:31

    End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

    Published on: December 15, 2023

  • Salient objects were defined as those consistently appearing prominent throughout a video, annotated using combined manual masks and multi-subject eye-tracking data.
  • An unsupervised approach employing saliency-guided stacked autoencoders was developed, extracting spatiotemporal cues at pixel, superpixel, and object levels.
  • Main Results:

    • The proposed unsupervised method outperformed 30 out of 31 state-of-the-art models on the new dataset.
    • The dataset proved highly challenging, demonstrating its potential to drive progress in video SOD.
    • The method successfully inferred pixel saliency by encoding spatiotemporal cues from neighbors.

    Conclusions:

    • The developed dataset and unsupervised method represent a significant advancement for video salient object detection.
    • The findings suggest that consistent object prominence is a key factor in defining video saliency.
    • The proposed approach offers a strong baseline and benchmark for future research in unsupervised video SOD.