Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Tracking people by learning their appearance.

Deva Ramanan1, David A Forsyth, Andrew Zisserman

  • 1ramanan@tti-c.org

IEEE Transactions on Pattern Analysis and Machine Intelligence
|November 17, 2006
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Sound of Water: Inferring Physical Properties from Pouring Liquids.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Identifying scoliosis in a population-based adult cohort: automation of a validated method based on total body dual energy X-ray absorptiometry scans.

European spine journal : official publication of the European Spine Society, the European Spinal Deformity Society, and the European Section of the Cervical Spine Research Society·2026
Same author

Detect+Track: robust and flexible software tools for improved tracking and behavioural analysis of fish.

Royal Society open science·2025
Same author

EPIC-SOUNDS: A Large-Scale Dataset of Actions That Sound.

IEEE transactions on pattern analysis and machine intelligence·2025
Same author

Automated detection of spinal bone marrow oedema in axial spondyloarthritis: training and validation using two large phase 3 trial datasets.

Rheumatology (Oxford, England)·2025
Same author

Audio-visual modelling in a clinical setting.

Scientific reports·2024
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

This study introduces an automatic system for tracking people in videos, counting individuals, and estimating their body configurations. The method excels even with occlusions and unpredictable movements, offering robust human pose estimation.

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Human Motion Analysis

Background:

  • Automatic human tracking in videos is challenging due to unpredictable motion, varied appearances, and cluttered backgrounds.
  • Estimating the number and configurations of people in each frame requires robust detection and localization of body parts.
  • Existing methods often struggle with occlusions, rapid movements, and distinguishing people from similar-looking background elements.

Purpose of the Study:

  • To develop a fully automatic system for tracking human articulations in video sequences.
  • To accurately count distinct individuals and estimate their body configurations.
  • To create a robust tracking system that can recover from occlusions and temporary view loss.

Main Methods:

  • A two-stage approach: first, building appearance models for each person, then detecting these models in subsequent frames ('tracking by model-building and detection').

Related Experiment Videos

  • Two model-building algorithms: a bottom-up approach grouping candidate body parts and a top-down approach detecting key poses.
  • Utilizing discriminative appearance models that leverage background structure without requiring background subtraction.
  • Main Results:

    • The system successfully counts distinct individuals and tracks them throughout video sequences.
    • Accurate identification and tracking capabilities were demonstrated, including recovery from occlusions and brief disappearances.
    • Precise body configuration estimation was achieved, independent of specific human motion models.

    Conclusions:

    • The developed 'tracking by model-building and detection' system provides a robust solution for automatic human tracking and pose estimation.
    • The approach is effective across diverse scenarios, including unscripted indoor/outdoor activities, films, and sports footage.
    • Discriminative appearance modeling enhances tracking performance by exploiting background context.