Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Videos

Human action recognition by semilatent topic models.

Yang Wang¹, Greg Mori

¹Simon Fraser University, Burnaby, Canada. ywang12@cs.sfu.ca

IEEE Transactions on Pattern Analysis and Machine Intelligence

|August 22, 2009

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Structured Label Inference for Visual Understanding.

IEEE transactions on pattern analysis and machine intelligence·2019

Same author

Deep Neural Network Compression by In-Parallel Pruning-Quantization.

IEEE transactions on pattern analysis and machine intelligence·2018

Same author

Validation of accuracy of SVM-based fall detection system using real-world fall and non-fall datasets.

PloS one·2017

Same author

Multi-Instance Classification by Max-Margin Training of Cardinality-Based Markov Networks.

IEEE transactions on pattern analysis and machine intelligence·2017

Same author

A comparison of accuracy of fall detection algorithms (threshold-based vs. machine learning) using waist-mounted tri-axial accelerometer signals from a comprehensive set of falls and non-fall trials.

Medical & biological engineering & computing·2016

Same author

Distinguishing the causes of falls in humans using an array of wearable tri-axial accelerometers.

Gait & posture·2013

Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

We developed new topic models for human action recognition in videos. These models simplify training and improve accuracy by directly linking topics to class labels, outperforming existing methods.

Area of Science:

Computer Vision
Machine Learning
Artificial Intelligence

Background:

Human action recognition from video is crucial for applications like surveillance and human-computer interaction.
Existing latent topic models for visual recognition face challenges in training complexity and determining the optimal number of topics.

Purpose of the Study:

To introduce two novel topic models for enhanced human action recognition from video sequences.
To address limitations of previous latent topic models by directly correlating latent topics with class labels and observing latent variables.

Main Methods:

A novel "bag-of-words" representation for video sequences, where each frame is treated as a "word."
Development of two new topic models with direct correspondence between latent topics and class labels.

Related Experiment Videos

Decoupling of model parameters to simplify the training process.

Main Results:

Achieved significantly improved performance in action classification across five diverse datasets.
Demonstrated results comparable to or surpassing previously published benchmarks.
Validated the effectiveness of utilizing class label information during training.

Conclusions:

The proposed topic models offer a more efficient and effective approach to human action recognition.
Directly incorporating class labels into topic models enhances recognition accuracy and simplifies model selection.
These models represent a significant advancement in the field of video-based action recognition.