Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Force Classification

Force Classification

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...

Fixed Action Patterns

Fixed Action Patterns

A fixed action pattern (FAP) is a specific, hard-wired sequence of behaviors that occurs in response to an external stimulus, called a sign stimulus. The behavior is “fixed” because it is essentially unchangeable—proceeding similarly across individuals of a species every time it occurs.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Selection, Aggregation, and Enhancement: Trajectory Consistent Diffusion Model for Image Super-Resolution.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

UDD: Unsupervised denoising diffusion for noisy multi-focus image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same author

Exploring Hierarchical Cross-Modal Correlation Consistency for Partial Mismatching.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Enhanced Cross-Modal Hashing via Hybrid Distillation and Structural Refinement.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2025

Same author

Focusing on pedestrians like human for clothes changing person re-identification.

Neural networks : the official journal of the International Neural Network Society·2025

Same author

Memory-augmented shuffled meta learning for visible-infrared person re-identification.

Neural networks : the official journal of the International Neural Network Society·2025

Same journal

Hyperbolic Cycle Alignment for Infrared-Visible Image Fusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Learning Gaze Synthesizer via 3D-eye Controlled Diffusion and Cross-domain Feature Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Underlying Semantic Diffusion for Effective and Efficient In-Context Learning.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

DiffRES: Unleashing Text-to-Image Diffusion Models for Generative Referring Expression Segmentation without Information Leakage.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Location Matters: Frequency-Spatial Dual Space Adaptation for Cross-Domain Few-Shot Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BayeTopo: Bayesian-based Topology-guided Learning for Vascular Imaging Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 1, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Self-Supervised Video-Based Action Recognition With Disturbances.

Wei Lin, Xinghao Ding, Yue Huang

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|April 26, 2023

Summary

This summary is machine-generated.

This study introduces VARD, a novel self-supervised method for video action recognition. VARD enhances action representation by focusing on core visual and semantic information, outperforming existing approaches without needing complex data like optical flow.

More Related Videos

Corticospinal Excitability Modulation During Action Observation

Corticospinal Excitability Modulation During Action Observation

Published on: December 31, 2013

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Related Experiment Videos

Last Updated: Aug 1, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Corticospinal Excitability Modulation During Action Observation

Corticospinal Excitability Modulation During Action Observation

Published on: December 31, 2013

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Area of Science:

Computer Vision
Machine Learning
Artificial Intelligence

Background:

Self-supervised video-based action recognition requires extracting key action information from diverse, unlabeled videos.
Existing methods often prioritize visual spatio-temporal features, neglecting semantic information crucial for human-like cognition.
Action recognition is challenged by variations in actors, scenes, and semantic encoding.

Purpose of the Study:

To propose VARD (Video-based Action Recognition with Disturbances), a novel self-supervised method for robust video action recognition.
To extract principal action information by integrating both visual and semantic attributes, inspired by human cognitive processes.
To develop a method that focuses on essential action characteristics by minimizing the impact of inconsequential visual and semantic variations.

Main Methods:

VARD constructs a 'positive' clip/embedding for each action video, which is visually/semantically disturbed compared to the original.
The method aims to minimize the distance between the original and disturbed representations in the latent space.
It leverages Video Disturbance and Embedding Disturbance techniques to achieve this objective.
Notably, VARD does not require optical flow, negative samples, or pretext tasks.

Main Results:

VARD effectively improves upon strong baselines in self-supervised action recognition.
The method demonstrates superior performance compared to multiple classical and advanced self-supervised techniques.
Experiments were validated on the widely-used UCF101 and HMDB51 datasets.

Conclusions:

VARD offers an effective approach to self-supervised video-based action recognition by integrating visual and semantic information.
The proposed disturbance-based method successfully drives networks to focus on principal action information, enhancing robustness.
VARD presents a simplified yet powerful alternative to existing methods, achieving state-of-the-art results without complex requirements.