Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Relative Motion Analysis using Rotating Axes01:25

Relative Motion Analysis using Rotating Axes

512
Consider a component AB undergoing a linear motion. Along with a linear motion, point B also rotates around point A. To comprehend this complex movement, position vectors for both points A and B are established using a stationary reference frame.
However, to express the relative position of point B relative to point A, an additional frame of reference, denoted as x'y', is necessary. This additional frame not only translates but also rotates relative to the fixed frame, making it...
512
Relative Motion Analysis using Rotating Axes-Problem Solving01:29

Relative Motion Analysis using Rotating Axes-Problem Solving

440
Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...
440
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

840
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
840

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Outdoor Motion Capture at Scale.

Sensors (Basel, Switzerland)·2026
Same author

Deep learning-based 2D keypoint detection in alpine ski racing - A performance analysis of state-of-the-art algorithms applied to regular skiing and injury situations.

JSAMS plus·2026
Same author

Comparison of augmented reality glasses for the assistive communication support of hearing loss.

Frontiers in neurology·2025
Same author

Temporally-Consistent Surface Reconstruction Using Metrically-Consistent Atlases.

IEEE transactions on pattern analysis and machine intelligence·2025
Same author

Harnessing deep learning to detect bronchiolitis obliterans syndrome from chest CT.

Communications medicine·2025
Same author

<i>MedShapeNet</i> - a large-scale dataset of 3D medical shapes for computer vision.

Biomedizinische Technik. Biomedical engineering·2024
Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Aug 25, 2025

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects
06:36

Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

Published on: October 18, 2024

1.1K

Temporal Representation Learning on Monocular Videos for 3D Human Pose Estimation.

Sina Honari, Victor Constantin, Helge Rhodin

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |October 17, 2022
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new unsupervised method for extracting temporal information from videos using contrastive self-supervised learning. The technique significantly improves human pose estimation accuracy, outperforming existing methods.

    More Related Videos

    Author Spotlight: Automated Deep Brain Stimulation for Parkinson's Disease - Exploring the Possibilities and Challenges of Home Monitoring
    06:32

    Author Spotlight: Automated Deep Brain Stimulation for Parkinson's Disease - Exploring the Possibilities and Challenges of Home Monitoring

    Published on: July 14, 2023

    1.4K
    Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders
    05:49

    Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

    Published on: November 1, 2024

    891

    Related Experiment Videos

    Last Updated: Aug 25, 2025

    Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects
    06:36

    Author Spotlight: Insights into the Analysis of Human Interaction with 3D Virtual Objects

    Published on: October 18, 2024

    1.1K
    Author Spotlight: Automated Deep Brain Stimulation for Parkinson's Disease - Exploring the Possibilities and Challenges of Home Monitoring
    06:32

    Author Spotlight: Automated Deep Brain Stimulation for Parkinson's Disease - Exploring the Possibilities and Challenges of Home Monitoring

    Published on: July 14, 2023

    1.4K
    Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders
    05:49

    Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

    Published on: November 1, 2024

    891

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Artificial Intelligence

    Background:

    • Extracting temporal information from monocular videos is challenging.
    • Existing contrastive self-supervised learning (CSS) methods often treat temporal features simplistically.
    • Accurate human pose estimation requires robust temporal feature extraction.

    Purpose of the Study:

    • To propose an unsupervised feature extraction method for capturing temporal information in monocular videos.
    • To improve human pose estimation using enhanced temporal feature extraction.
    • To develop a novel CSS approach that disentangles time-variant and time-invariant features.

    Main Methods:

    • Detecting and encoding subjects of interest in each video frame.
    • Leveraging contrastive self-supervised learning (CSS) with disentangled latent vectors.
    • Applying contrastive loss to time-variant features with gradual transition objectives.
    • Incorporating input reconstruction for enhanced feature learning.

    Main Results:

    • Reduced human pose estimation error by approximately 50% compared to standard CSS methods.
    • Outperformed other unsupervised single-view methods for temporal feature extraction.
    • Achieved performance comparable to multi-view techniques in pose estimation.
    • Significantly improved 3D pose estimation accuracy when 2D pose information was available.

    Conclusions:

    • The proposed method effectively captures rich temporal features from monocular videos.
    • Disentangling time-variant and time-invariant components enhances feature representation.
    • This approach offers a powerful unsupervised solution for human pose estimation and related tasks.