The power of video-level classification in almost all previous video anomaly detection is either overlooked or not studied well explicitly. With addition of a BERT or LSTM video classification, we achieve new SOTA results on UCF-Crime, ShanghaiTech, and XD-Violence datasts.
Standard MIL with BERT video classification
Modificed RTFM with BERT video classification