Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Force Classification01:22

Force Classification

1.3K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
1.3K
Fixed Action Patterns01:06

Fixed Action Patterns

16.2K
A fixed action pattern (FAP) is a specific, hard-wired sequence of behaviors that occurs in response to an external stimulus, called a sign stimulus. The behavior is “fixed” because it is essentially unchangeable—proceeding similarly across individuals of a species every time it occurs.
16.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Selection, Aggregation, and Enhancement: Trajectory Consistent Diffusion Model for Image Super-Resolution.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

UDD: Unsupervised denoising diffusion for noisy multi-focus image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

Exploring Hierarchical Cross-Modal Correlation Consistency for Partial Mismatching.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Enhanced Cross-Modal Hashing via Hybrid Distillation and Structural Refinement.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2025
Same author

Focusing on pedestrians like human for clothes changing person re-identification.

Neural networks : the official journal of the International Neural Network Society·2025
Same author

Memory-augmented shuffled meta learning for visible-infrared person re-identification.

Neural networks : the official journal of the International Neural Network Society·2025
Same journal

Hyperbolic Cycle Alignment for Infrared-Visible Image Fusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Learning Gaze Synthesizer via 3D-eye Controlled Diffusion and Cross-domain Feature Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Underlying Semantic Diffusion for Effective and Efficient In-Context Learning.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

DiffRES: Unleashing Text-to-Image Diffusion Models for Generative Referring Expression Segmentation without Information Leakage.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Location Matters: Frequency-Spatial Dual Space Adaptation for Cross-Domain Few-Shot Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BayeTopo: Bayesian-based Topology-guided Learning for Vascular Imaging Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Aug 1, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

3.9K

Self-Supervised Video-Based Action Recognition With Disturbances.

Wei Lin, Xinghao Ding, Yue Huang

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |April 26, 2023
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces VARD, a novel self-supervised method for video action recognition. VARD enhances action representation by focusing on core visual and semantic information, outperforming existing approaches without needing complex data like optical flow.

    More Related Videos

    Corticospinal Excitability Modulation During Action Observation
    12:33

    Corticospinal Excitability Modulation During Action Observation

    Published on: December 31, 2013

    9.0K
    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.0K

    Related Experiment Videos

    Last Updated: Aug 1, 2025

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    3.9K
    Corticospinal Excitability Modulation During Action Observation
    12:33

    Corticospinal Excitability Modulation During Action Observation

    Published on: December 31, 2013

    9.0K
    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.0K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Artificial Intelligence

    Background:

    • Self-supervised video-based action recognition requires extracting key action information from diverse, unlabeled videos.
    • Existing methods often prioritize visual spatio-temporal features, neglecting semantic information crucial for human-like cognition.
    • Action recognition is challenged by variations in actors, scenes, and semantic encoding.

    Purpose of the Study:

    • To propose VARD (Video-based Action Recognition with Disturbances), a novel self-supervised method for robust video action recognition.
    • To extract principal action information by integrating both visual and semantic attributes, inspired by human cognitive processes.
    • To develop a method that focuses on essential action characteristics by minimizing the impact of inconsequential visual and semantic variations.

    Main Methods:

    • VARD constructs a 'positive' clip/embedding for each action video, which is visually/semantically disturbed compared to the original.
    • The method aims to minimize the distance between the original and disturbed representations in the latent space.
    • It leverages Video Disturbance and Embedding Disturbance techniques to achieve this objective.
    • Notably, VARD does not require optical flow, negative samples, or pretext tasks.

    Main Results:

    • VARD effectively improves upon strong baselines in self-supervised action recognition.
    • The method demonstrates superior performance compared to multiple classical and advanced self-supervised techniques.
    • Experiments were validated on the widely-used UCF101 and HMDB51 datasets.

    Conclusions:

    • VARD offers an effective approach to self-supervised video-based action recognition by integrating visual and semantic information.
    • The proposed disturbance-based method successfully drives networks to focus on principal action information, enhancing robustness.
    • VARD presents a simplified yet powerful alternative to existing methods, achieving state-of-the-art results without complex requirements.