Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

New fused tetracyclic ring systems: synthesis of 1-thia-4<i>a</i>,11,12-triazatetracenes <i>via</i> a multicomponent strategy, 2D-HMBC structural confirmation, and antimicrobial activity.

RSC advances·2026

Same author

Veterinary fracture diagnosis: a deep learning model for dogs long bone fractures.

Scientific reports·2026

Same author

A deep residual 1D-CNN with self-attention for fraud transaction detection in virtual economies.

Scientific reports·2026

Same author

Enhanced YOLO12 with spatial pyramid pooling for real-time cotton insect detection.

Scientific reports·2026

Same author

Multi-Classification of Drug-Drug interaction based on a complete graph convolutional neural network and explainable artificial intelligence.

Journal of bioinformatics and computational biology·2025

Same author

Explainable multi stream deep learning for fine grained camel breed classification using a Novel Arabian and Non Arabian dataset.

Scientific reports·2025

Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Physics-encoded convolutional neural operators for parametric PDEs: A convergence-guaranteed framework via pre-computed kernel fields.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 20, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Real-time multiple spatiotemporal action localization and prediction approach using deep learning.

Ahmed Ali Hammam¹, Mona M Soliman¹, Aboul Ella Hassanien²

¹Faculty of Computers and Artificial Intelligence, Cairo University, Egypt; Member of Scientific Research Group in Egypt (SRGE), Egypt.

Neural Networks : the Official Journal of the International Neural Network Society

|May 30, 2020

Summary

This summary is machine-generated.

This study introduces a fast deep-learning method for real-time action localization and prediction in videos. It uses convolutional neural networks and a two-stream model for accurate and speedy detection of multiple actions.

Keywords:

Action localization Action prediction Deep learning Optical flow Spatiotemporal YOLO network

More Related Videos

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Related Experiment Videos

Last Updated: Dec 20, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Area of Science:

Computer Vision
Machine Learning
Deep Learning

Background:

Action localization and prediction in videos is a challenging problem, especially for real-time applications.
Existing methods often focus on single actions per frame or operate offline, limiting practical use.
Convolutional Neural Networks (ConvNets) excel in image tasks but have limited application in real-time video action analysis.

Purpose of the Study:

To develop a fast and accurate deep-learning approach for real-time action localization and prediction.
To enable the detection and classification of multiple actions simultaneously within video streams.
To improve upon existing methods in terms of both speed and precision for video-based action analysis.

Main Methods:

Utilized convolutional neural networks (ConvNets) for action localization and prediction.
Employed a two-stream model incorporating appearance and motion detection networks (You Only Look Once - YOLO) on RGB and optical flow frames.
Implemented a fusion step to enhance localization accuracy and generated action tubes from frame-level detections.

Main Results:

Achieved real-time performance for localizing and predicting multiple actions in videos.
Demonstrated superior processing speed and accuracy compared to existing offline and online approaches.
Validated the approach on challenging benchmarks like UCF-101-24 and J-HMDB-21.

Conclusions:

The proposed deep-learning approach offers a significant advancement in real-time video action localization and prediction.
The method provides a robust solution for early action detection and prediction with high performance.
This work paves the way for more efficient and accurate video analysis systems in real-world scenarios.