Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Video

Updated: Jul 1, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

A Benchmark Dataset and Saliency-Guided Stacked Autoencoders for Video-Based Salient Object Detection.

Jia Li, Changqun Xia, Xiaowu Chen

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|October 14, 2017

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Learning When and How to Update Memory for Video Object Segmentation.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

Rethinking Lightweight Salient Object Detection via Network Depth-Width Tradeoff.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2023

Same author

Boosting Broader Receptive Fields for Salient Object Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2023

Same author

Salient Object Detection With Purificatory Mechanism and Structural Similarity Loss.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2021

Same author

Semantic Object Segmentation in Tagged Videos via Detection.

IEEE transactions on pattern analysis and machine intelligence·2017

Same author

Transparent metals for ultrabroadband electromagnetic waves.

Advanced materials (Deerfield Beach, Fla.)·2012

Same journal

Hyperbolic Cycle Alignment for Infrared-Visible Image Fusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Learning Gaze Synthesizer via 3D-eye Controlled Diffusion and Cross-domain Feature Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Underlying Semantic Diffusion for Effective and Efficient In-Context Learning.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

DiffRES: Unleashing Text-to-Image Diffusion Models for Generative Referring Expression Segmentation without Information Leakage.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Location Matters: Frequency-Spatial Dual Space Adaptation for Cross-Domain Few-Shot Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BayeTopo: Bayesian-based Topology-guided Learning for Vascular Imaging Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

Researchers developed a new dataset and unsupervised method for video salient object detection (SOD). The approach effectively identifies consistently salient objects across video frames, outperforming most existing models.

Area of Science:

Computer Vision
Machine Learning
Human-Computer Interaction

Background:

Image-based salient object detection (SOD) is well-researched, but video-based SOD lacks large-scale, well-annotated datasets.
Existing video SOD research is limited by the absence of unambiguous definitions and annotations for salient objects in dynamic scenes.

Purpose of the Study:

To introduce the largest dataset to date for video-based salient object detection (SOD).
To propose an unsupervised baseline method for video SOD using saliency-guided stacked autoencoders.
To establish a challenging benchmark for advancing video SOD research.

Main Methods:

A new dataset comprising 200 videos with manually annotated objects/regions on 7650 keyframes and eye-tracking data from 23 subjects was created.

More Related Videos

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

Published on: December 15, 2023

Related Experiment Videos

Last Updated: Jul 1, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

End-To-End Deep Neural Network for Salient Object Detection in Complex Environments

Published on: December 15, 2023

Salient objects were defined as those consistently appearing prominent throughout a video, annotated using combined manual masks and multi-subject eye-tracking data.

An unsupervised approach employing saliency-guided stacked autoencoders was developed, extracting spatiotemporal cues at pixel, superpixel, and object levels.

Main Results:

The proposed unsupervised method outperformed 30 out of 31 state-of-the-art models on the new dataset.
The dataset proved highly challenging, demonstrating its potential to drive progress in video SOD.
The method successfully inferred pixel saliency by encoding spatiotemporal cues from neighbors.

Conclusions:

The developed dataset and unsupervised method represent a significant advancement for video salient object detection.
The findings suggest that consistent object prominence is a key factor in defining video saliency.
The proposed approach offers a strong baseline and benchmark for future research in unsupervised video SOD.