Multimodal Pathway for Video Recognition

This part of code is based on VideoMAE. Thanks for their outstanding project.

Usage

pip install -r requirements.txt

You can easily prepare data with OpenData Lab by running commands below

pip install openxlab
openxlab dataset get --dataset-repo OpenMMLab/Kinetics-400

We provide the experiment scripts for easier use:

bash run.sh

Please edit run.sh before running the code.