Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

New fused tetracyclic ring systems: synthesis of 1-thia-4<i>a</i>,11,12-triazatetracenes <i>via</i> a multicomponent strategy, 2D-HMBC structural confirmation, and antimicrobial activity.

RSC advances·2026
Same author

Veterinary fracture diagnosis: a deep learning model for dogs long bone fractures.

Scientific reports·2026
Same author

A deep residual 1D-CNN with self-attention for fraud transaction detection in virtual economies.

Scientific reports·2026
Same author

Enhanced YOLO12 with spatial pyramid pooling for real-time cotton insect detection.

Scientific reports·2026
Same author

Multi-Classification of Drug-Drug interaction based on a complete graph convolutional neural network and explainable artificial intelligence.

Journal of bioinformatics and computational biology·2025
Same author

Explainable multi stream deep learning for fine grained camel breed classification using a Novel Arabian and Non Arabian dataset.

Scientific reports·2025
Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Physics-encoded convolutional neural operators for parametric PDEs: A convergence-guaranteed framework via pre-computed kernel fields.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Dec 20, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

926

Real-time multiple spatiotemporal action localization and prediction approach using deep learning.

Ahmed Ali Hammam1, Mona M Soliman1, Aboul Ella Hassanien2

  • 1Faculty of Computers and Artificial Intelligence, Cairo University, Egypt; Member of Scientific Research Group in Egypt (SRGE), Egypt.

Neural Networks : the Official Journal of the International Neural Network Society
|May 30, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces a fast deep-learning method for real-time action localization and prediction in videos. It uses convolutional neural networks and a two-stream model for accurate and speedy detection of multiple actions.

Keywords:
Action localizationAction predictionDeep learningOptical flowSpatiotemporalYOLO network

More Related Videos

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.8K
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

5.1K

Related Experiment Videos

Last Updated: Dec 20, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

926
A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.8K
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

5.1K

Area of Science:

  • Computer Vision
  • Machine Learning
  • Deep Learning

Background:

  • Action localization and prediction in videos is a challenging problem, especially for real-time applications.
  • Existing methods often focus on single actions per frame or operate offline, limiting practical use.
  • Convolutional Neural Networks (ConvNets) excel in image tasks but have limited application in real-time video action analysis.

Purpose of the Study:

  • To develop a fast and accurate deep-learning approach for real-time action localization and prediction.
  • To enable the detection and classification of multiple actions simultaneously within video streams.
  • To improve upon existing methods in terms of both speed and precision for video-based action analysis.

Main Methods:

  • Utilized convolutional neural networks (ConvNets) for action localization and prediction.
  • Employed a two-stream model incorporating appearance and motion detection networks (You Only Look Once - YOLO) on RGB and optical flow frames.
  • Implemented a fusion step to enhance localization accuracy and generated action tubes from frame-level detections.

Main Results:

  • Achieved real-time performance for localizing and predicting multiple actions in videos.
  • Demonstrated superior processing speed and accuracy compared to existing offline and online approaches.
  • Validated the approach on challenging benchmarks like UCF-101-24 and J-HMDB-21.

Conclusions:

  • The proposed deep-learning approach offers a significant advancement in real-time video action localization and prediction.
  • The method provides a robust solution for early action detection and prediction with high performance.
  • This work paves the way for more efficient and accurate video analysis systems in real-world scenarios.