Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Buoyancy-Dependent Flow Generation by Collectively Migrating Swimmers.

Integrative and comparative biology·2026
Same author

Biohybrid Robotic Jellyfish for Swimming-Enhanced Vertical Ocean Profiling.

Biomimetics (Basel, Switzerland)·2026
Same author

Digital Volume Correlation Challenge 2.0: A Comprehensive Dataset for Digital Volume Correlation Benchmarking.

Research square·2026
Same author

Top-down regulation of ingestive behavior fragmentation.

bioRxiv : the preprint server for biology·2026
Same author

Evolutionary basis of male same-sex sexual behavior by multiple pheromone switches in Drosophila.

Current biology : CB·2026
Same author

Verifiably stable nonlinear control with reinforcement-learned diffractive optical networks.

Optics express·2026
Same journal

CARL: A Framework for Equivariant Image Registration.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026
Same journal

Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026
Same journal

Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026
Same journal

The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026
Same journal

Perceptual Inductive Bias Is What You Need Before Contrastive Learning.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026
Same journal

MultiMorph: On-demand Atlas Construction.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition·2026
See all related articles

Related Experiment Video

Updated: Aug 14, 2025

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.5K

Self-Supervised Keypoint Discovery in Behavioral Videos.

Jennifer J Sun1, Serim Ryou1, Roni H Goldshmid1

  • 1Caltech.

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition
|January 11, 2023
PubMed
Summary
This summary is machine-generated.

We developed Behavioral Keypoint Discovery (B-KinD), a self-supervised method to learn agent posture and structure from unlabeled videos. This approach identifies meaningful body parts, achieving state-of-the-art results in keypoint regression and reducing training costs.

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K
Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut
08:32

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Published on: June 15, 2020

12.6K

Related Experiment Videos

Last Updated: Aug 14, 2025

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.5K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K
Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut
08:32

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Published on: June 15, 2020

12.6K

Area of Science:

  • Computer Vision
  • Machine Learning
  • Bioinformatics

Background:

  • Learning agent posture and structure from unlabeled videos is challenging.
  • Existing methods often require manual annotations, increasing costs and time.
  • Agent movement is a primary source of information in behavioral videos.

Purpose of the Study:

  • To propose a novel self-supervised method for discovering keypoints representing agent posture and structure from unlabeled behavioral videos.
  • To demonstrate the generality and effectiveness of the proposed method across diverse agent types.
  • To reduce the reliance on manual annotations in training machine learning models for behavioral analysis.

Main Methods:

  • Behavioral Keypoint Discovery (B-KinD) utilizes an encoder-decoder architecture with a geometric bottleneck.
  • The method reconstructs the spatiotemporal difference between video frames, focusing on regions of movement.
  • It operates directly on input videos without requiring manual annotations.

Main Results:

  • B-KinD successfully discovers semantically meaningful keypoints across various agents (mouse, fly, human, jellyfish, trees).
  • The method achieves state-of-the-art performance in keypoint regression among self-supervised approaches.
  • Discovered keypoints perform comparably to supervised keypoints in downstream tasks like behavior classification.

Conclusions:

  • B-KinD offers a powerful self-supervised approach for extracting structural and postural information from unlabeled behavioral videos.
  • The method significantly reduces the cost and effort associated with model training compared to supervised methods.
  • This technique has broad applicability in analyzing animal behavior, biomechanics, and plant movement.