Skip to content

ibraztech2/Action-Localization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎬 Video Action Localization App

Video Action Localization App is a Streamlit-powered web interface that processes video input to detect and localize actions across frames using spatio-temporal deep learning models. It leverages MMAction2’s SlowFast-ACRN model for action recognition and Faster R-CNN for spatial localization.

A complete, Colab-ready solution for spatio-temporal action detection in videos, featuring:

βœ… SlowFast-ACRN for action recognition βœ… Faster R-CNN for human/object detection βœ… Streamlit interface for interactivity βœ… Google Colab backend for GPU-powered inference

πŸš€ Quick Start

βœ… Prerequisites

  • A Google Colab account
  • GPU runtime enabled in Colab (Runtime > Change runtime type > GPU)
  • A free Ngrok account
  • Your personal Ngrok Auth Token

✨ Features

  • πŸ“€ Upload .mp4, .avi, or .mov video files
  • 🧠 Perform frame-wise action localization with SlowFast-ACRN
  • πŸ“₯ Download annotated output video with bounding boxes and action labels

🧠 Model Architecture

Component Description Backbone SlowFast Network with ACRN Head Detector Faster R-CNN (ResNet-50-FPN) Trained On Kinetics-400 Dataset

πŸ› οΈ Usage

This app is designed to run entirely in Google Colab, using GPU resources for efficient inference, while Streamlit provides the user interface.

  • Run the notebook sequentially from top to bottom
  • You may be prompted to restart runtime after dependency installations
  • Once Ngrok tunnel is active, you'll get a public URL to access your Streamlit app

πŸ“ Project Structure

β”œβ”€β”€ Action_Localization.ipynb # πŸš€ Main Colab notebook for setup & execution β”œβ”€β”€ app.py # πŸ’» Streamlit app logic β”œβ”€β”€ checkpoints/ # 🧠 Model weight files β”‚ β”œβ”€β”€ slowfast-acrn.pth # - Action recognition weights β”‚ └── faster_rcnn.pth # - Object detection weights β”œβ”€β”€ mmaction2/ # πŸ“¦ MMAction2 framework (cloned repo) β”‚ β”œβ”€β”€ configs/ # - Model configuration files β”‚ β”œβ”€β”€ demo/ # - Sample media for testing └── requirements.txt # πŸ“Œ Python dependencies

Google colab

πŸ‘€ Author Ibrahim Mustapha Mechatronics Engineering Student | Computer Vision & AI Enthusiast πŸ”— GitHub πŸ”— linkedin πŸ“§ Email

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published