Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Buoyancy-Dependent Flow Generation by Collectively Migrating Swimmers.

Integrative and comparative biology·2026

Same author

Biohybrid Robotic Jellyfish for Swimming-Enhanced Vertical Ocean Profiling.

Biomimetics (Basel, Switzerland)·2026

Same author

Digital Volume Correlation Challenge 2.0: A Comprehensive Dataset for Digital Volume Correlation Benchmarking.

Research square·2026

Same author

Top-down regulation of ingestive behavior fragmentation.

bioRxiv : the preprint server for biology·2026

Same author

Evolutionary basis of male same-sex sexual behavior by multiple pheromone switches in Drosophila.

Current biology : CB·2026

Same author

Verifiably stable nonlinear control with reinforcement-learned diffractive optical networks.

Optics express·2026

Same journal

CARL: A Framework for Equivariant Image Registration.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026

Same journal

Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026

Same journal

Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026

Same journal

The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026

Same journal

Perceptual Inductive Bias Is What You Need Before Contrastive Learning.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026

Same journal

MultiMorph: On-demand Atlas Construction.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 14, 2025

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Self-Supervised Keypoint Discovery in Behavioral Videos.

Jennifer J Sun¹, Serim Ryou¹, Roni H Goldshmid¹

¹Caltech.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition

|January 11, 2023

Summary

This summary is machine-generated.

We developed Behavioral Keypoint Discovery (B-KinD), a self-supervised method to learn agent posture and structure from unlabeled videos. This approach identifies meaningful body parts, achieving state-of-the-art results in keypoint regression and reducing training costs.

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Published on: June 15, 2020

Related Experiment Videos

Last Updated: Aug 14, 2025

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Published on: June 15, 2020

Area of Science:

Computer Vision
Machine Learning
Bioinformatics

Background:

Learning agent posture and structure from unlabeled videos is challenging.
Existing methods often require manual annotations, increasing costs and time.
Agent movement is a primary source of information in behavioral videos.

Purpose of the Study:

To propose a novel self-supervised method for discovering keypoints representing agent posture and structure from unlabeled behavioral videos.
To demonstrate the generality and effectiveness of the proposed method across diverse agent types.
To reduce the reliance on manual annotations in training machine learning models for behavioral analysis.

Main Methods:

Behavioral Keypoint Discovery (B-KinD) utilizes an encoder-decoder architecture with a geometric bottleneck.
The method reconstructs the spatiotemporal difference between video frames, focusing on regions of movement.
It operates directly on input videos without requiring manual annotations.

Main Results:

B-KinD successfully discovers semantically meaningful keypoints across various agents (mouse, fly, human, jellyfish, trees).
The method achieves state-of-the-art performance in keypoint regression among self-supervised approaches.
Discovered keypoints perform comparably to supervised keypoints in downstream tasks like behavior classification.

Conclusions:

B-KinD offers a powerful self-supervised approach for extracting structural and postural information from unlabeled behavioral videos.
The method significantly reduces the cost and effort associated with model training compared to supervised methods.
This technique has broad applicability in analyzing animal behavior, biomechanics, and plant movement.