Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Muscle Coordination and Action01:24

Muscle Coordination and Action

2.2K
Muscle coordination is a complex and finely tuned process essential for smooth and purposeful movements like flexion, extension, adduction, abduction, and rotation. The human body orchestrates the actions of various muscles working in concert, each with a specific role. Four functional types describe how muscles work together: agonist, antagonist, synergist, and fixator.
Agonists
Agonist muscles, often called prime movers, are the primary muscles responsible for producing a specific movement....
2.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cardiac 3D Mechanical and Electrical Signal Reconstruction via Defocused Speckle Imaging.

IEEE transactions on bio-medical engineering·2026
Same author

Detecting multiple fiducial markers from a camera seismocardiogram.

Biomedical optics express·2026
Same author

Cardiac 3D Motion Reconstruction Using Dual-Camera Defocused Speckle Imaging With Multi-Scale Amplification.

IEEE journal of biomedical and health informatics·2025
Same author

A Dual Defocused Camera System For Reconstructing the Cardiac Z-axis Vibrations.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Facial Privacy Protection for Remote Photoplethysmography.

IEEE journal of biomedical and health informatics·2025
Same author

Learning Knowledge-Based Prompts for Robust 3D Mask Presentation Attack Detection.

IEEE transactions on pattern analysis and machine intelligence·2025
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Oct 1, 2025

Estimation of Contact Regions Between Hands and Objects During Human Multi-Digit Grasping
09:41

Estimation of Contact Regions Between Hands and Objects During Human Multi-Digit Grasping

Published on: April 21, 2023

1.8K

Constructing Stronger and Faster Baselines for Skeleton-Based Action Recognition.

Yi-Fan Song, Zhang Zhang, Caifeng Shan

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |March 7, 2022
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new, highly efficient model architecture for identifying human actions from skeleton data. By simplifying complex neural networks and using a smart scaling strategy, the researchers created a system that is faster and smaller than existing top-tier models while maintaining high accuracy.

    Keywords:
    deep learningcomputer visionmodel efficiencyneural networks

    Frequently Asked Questions

    More Related Videos

    Subject-specific Musculoskeletal Model for Studying Bone Strain During Dynamic Motion
    09:32

    Subject-specific Musculoskeletal Model for Studying Bone Strain During Dynamic Motion

    Published on: April 11, 2018

    9.8K
    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
    05:41

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

    Published on: February 6, 2020

    9.5K

    Related Experiment Videos

    Last Updated: Oct 1, 2025

    Estimation of Contact Regions Between Hands and Objects During Human Multi-Digit Grasping
    09:41

    Estimation of Contact Regions Between Hands and Objects During Human Multi-Digit Grasping

    Published on: April 21, 2023

    1.8K
    Subject-specific Musculoskeletal Model for Studying Bone Strain During Dynamic Motion
    09:32

    Subject-specific Musculoskeletal Model for Studying Bone Strain During Dynamic Motion

    Published on: April 11, 2018

    9.8K
    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
    05:41

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

    Published on: February 6, 2020

    9.5K

    Area of Science:

    • Computer vision research within skeleton-based action recognition
    • Artificial intelligence and deep learning methodology

    Background:

    Current methods for interpreting human movement from skeletal data often rely on overly complex architectures. These sophisticated models frequently suffer from excessive parameter counts that hinder practical deployment. High computational demands lead to significant challenges during the training and validation phases on massive datasets. No prior work had resolved the trade-off between model performance and operational efficiency. Researchers have struggled to balance high recognition accuracy with the need for lightweight, rapid processing. This gap motivated the development of streamlined alternatives to existing state-of-the-art frameworks. That uncertainty drove the exploration of more compact network designs for spatial-temporal analysis. The field required a shift toward architectures that prioritize both speed and resource management.

    Purpose Of The Study:

    The study aims to develop a more efficient baseline for skeleton-based action recognition. Researchers sought to address the excessive complexity found in recent state-of-the-art models. The primary motivation was to reduce the high validation costs associated with training large-scale architectures. They identified a need for models that extract discriminative features without relying on over-parameterized structures. This work investigates the integration of separable convolutional layers to improve computational efficiency. The team also intended to create a scalable framework that allows for flexible model expansion. By focusing on both speed and accuracy, they aimed to provide a practical alternative for researchers. The project seeks to establish a new standard for lightweight and high-performing skeleton analysis systems.

    Main Methods:

    The review approach involved designing an efficient baseline by embedding separable convolutional layers into a Multiple Input Branches network. Researchers implemented a compound scaling strategy to adjust the depth and width of the model simultaneously. They utilized the PyTorch library to construct and evaluate these architectures. The team tested their models on two major datasets, specifically NTU RGB+D 60 and 120. This design process focused on minimizing the number of trainable parameters while preserving high recognition precision. The investigators compared their results against established state-of-the-art methods to validate performance gains. They ensured that the inference speed was measured alongside accuracy to demonstrate practical utility. This systematic evaluation provided a clear comparison between the new baseline and existing complex systems.

    Main Results:

    The EfficientGCN-B4 baseline achieved an accuracy of 92.1% on the cross-subject benchmark of the NTU 60 dataset. This model proved to be 5.82 times smaller than the MS-G3D framework. Furthermore, the proposed system demonstrated a 5.85 times increase in inference speed compared to the same reference model. These metrics confirm that the new architecture maintains high performance while significantly reducing computational overhead. The researchers successfully generated a family of models, labeled EfficientGCN-Bx, through their synchronous scaling technique. Each variant in this family provides a balance between parameter count and recognition capability. The results indicate that the new approach consistently outperforms previous state-of-the-art methods on large-scale data. This evidence supports the efficacy of the proposed design for skeleton-based tasks.

    Conclusions:

    The authors demonstrate that their proposed architecture achieves superior performance compared to existing complex models. Their scaling strategy allows for the synchronous expansion of network depth and width. This approach results in a family of models that maintain high accuracy with fewer parameters. The EfficientGCN-B4 variant specifically outperforms established benchmarks on large-scale datasets. These findings suggest that simpler, well-designed networks can surpass the capabilities of over-parameterized systems. The study provides a practical path for reducing validation costs in action recognition tasks. The researchers confirm that their framework offers significant improvements in both model size and inference speed. These results highlight the potential for efficient graph-based learning in computer vision applications.

    The researchers propose an early fused Multiple Input Branches network that integrates separable convolutional layers. This mechanism improves feature extraction efficiency compared to traditional, highly complex models that rely on over-parameterized layers.

    The authors utilize a compound scaling strategy to expand the model's width and depth synchronously. This approach differs from standard methods that often scale only one dimension, allowing for a more balanced increase in capacity.

    Separable convolutional layers are necessary to reduce the total number of trainable parameters. These layers provide a more efficient alternative to standard convolutions, which are often computationally expensive in large-scale skeleton datasets.

    The NTU RGB+D 60 and 120 datasets serve as the primary benchmarks. These large-scale collections allow the researchers to compare their model's performance against existing state-of-the-art methods under rigorous cross-subject conditions.

    The EfficientGCN-B4 baseline achieves 92.1% accuracy on the cross-subject benchmark. This measurement demonstrates higher precision than the MS-G3D model, which serves as the primary point of comparison for accuracy and speed.

    The authors suggest that their framework significantly lowers validation costs for future architectural research. They propose that this efficiency gain allows for faster iteration when testing new designs on massive datasets.