SOTAVerified

MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly Detection

2024-03-21CVPR 2024Code Available2· sign in to hype

Jakub Micorek, Horst Possegger, Dominik Narnhofer, Horst Bischof, Mateusz Kozinski

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We propose a novel approach to video anomaly detection: we treat feature vectors extracted from videos as realizations of a random variable with a fixed distribution and model this distribution with a neural network. This lets us estimate the likelihood of test videos and detect video anomalies by thresholding the likelihood estimates. We train our video anomaly detector using a modification of denoising score matching, a method that injects training data with noise to facilitate modeling its distribution. To eliminate hyperparameter selection, we model the distribution of noisy video features across a range of noise levels and introduce a regularizer that tends to align the models for different levels of noise. At test time, we combine anomaly indications at multiple noise scales with a Gaussian mixture model. Running our video anomaly detector induces minimal delays as inference requires merely extracting the features and forward-propagating them through a shallow neural network and a Gaussian mixture model. Our experiments on five popular video anomaly detection benchmarks demonstrate state-of-the-art performance, both in the object-centric and in the frame-centric setup.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
CUHK AvenueMULDE-object-centric-microAUC94.3Unverified
ShanghaiTechMULDE-object-centric-microAUC86.7Unverified
ShanghaiTechMULDE-frame-centric-microAUC81.3Unverified
UBnormalMULDE-frame-centric-micro-one-class-classificationAUC72.8Unverified
UCF-CrimeMULDE-frame-centric-micro-one-class-classificationAUC78.5Unverified
UCSD Ped2MULDE-object-centric-microAUC99.7Unverified

Reproductions