Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Relative Motion Analysis using Rotating Axes

Relative Motion Analysis using Rotating Axes

Consider a component AB undergoing a linear motion. Along with a linear motion, point B also rotates around point A. To comprehend this complex movement, position vectors for both points A and B are established using a stationary reference frame.
However, to express the relative position of point B relative to point A, an additional frame of reference, denoted as x'y', is necessary. This additional frame not only translates but also rotates relative to the fixed frame, making it...

Relative Motion Analysis using Rotating Axes-Problem Solving

Relative Motion Analysis using Rotating Axes-Problem Solving

Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Outdoor Motion Capture at Scale.

Sensors (Basel, Switzerland)·2026

Same author

Deep learning-based 2D keypoint detection in alpine ski racing - A performance analysis of state-of-the-art algorithms applied to regular skiing and injury situations.

JSAMS plus·2026

Same author

Comparison of augmented reality glasses for the assistive communication support of hearing loss.

Frontiers in neurology·2025

Same author

Temporally-Consistent Surface Reconstruction Using Metrically-Consistent Atlases.

IEEE transactions on pattern analysis and machine intelligence·2025

Same author

Harnessing deep learning to detect bronchiolitis obliterans syndrome from chest CT.

Communications medicine·2025

Same author

<i>MedShapeNet</i> - a large-scale dataset of 3D medical shapes for computer vision.

Biomedizinische Technik. Biomedical engineering·2024

Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 25, 2025

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Published on: October 18, 2024

Temporal Representation Learning on Monocular Videos for 3D Human Pose Estimation.

Sina Honari, Victor Constantin, Helge Rhodin

IEEE Transactions on Pattern Analysis and Machine Intelligence

|October 17, 2022

Summary

This summary is machine-generated.

This study introduces a new unsupervised method for extracting temporal information from videos using contrastive self-supervised learning. The technique significantly improves human pose estimation accuracy, outperforming existing methods.

More Related Videos

Author Spotlight: Automated Deep Brain Stimulation for Parkinson's Disease - Exploring the Possibilities and Challenges of Home Monitoring

Author Spotlight: Automated Deep Brain Stimulation for Parkinson's Disease - Exploring the Possibilities and Challenges of Home Monitoring

Published on: July 14, 2023

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Published on: November 1, 2024

Related Experiment Videos

Last Updated: Aug 25, 2025

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Published on: October 18, 2024

Author Spotlight: Automated Deep Brain Stimulation for Parkinson's Disease - Exploring the Possibilities and Challenges of Home Monitoring

Author Spotlight: Automated Deep Brain Stimulation for Parkinson's Disease - Exploring the Possibilities and Challenges of Home Monitoring

Published on: July 14, 2023

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Published on: November 1, 2024

Area of Science:

Computer Vision
Machine Learning
Artificial Intelligence

Background:

Extracting temporal information from monocular videos is challenging.
Existing contrastive self-supervised learning (CSS) methods often treat temporal features simplistically.
Accurate human pose estimation requires robust temporal feature extraction.

Purpose of the Study:

To propose an unsupervised feature extraction method for capturing temporal information in monocular videos.
To improve human pose estimation using enhanced temporal feature extraction.
To develop a novel CSS approach that disentangles time-variant and time-invariant features.

Main Methods:

Detecting and encoding subjects of interest in each video frame.
Leveraging contrastive self-supervised learning (CSS) with disentangled latent vectors.
Applying contrastive loss to time-variant features with gradual transition objectives.
Incorporating input reconstruction for enhanced feature learning.

Main Results:

Reduced human pose estimation error by approximately 50% compared to standard CSS methods.
Outperformed other unsupervised single-view methods for temporal feature extraction.
Achieved performance comparable to multi-view techniques in pose estimation.
Significantly improved 3D pose estimation accuracy when 2D pose information was available.

Conclusions:

The proposed method effectively captures rich temporal features from monocular videos.
Disentangling time-variant and time-invariant components enhances feature representation.
This approach offers a powerful unsupervised solution for human pose estimation and related tasks.