SOTAVerified

Auto-Encoding Score Distribution Regression for Action Quality Assessment

2021-11-22Code Available1· sign in to hype

Boyu Zhang, Jiayuan Chen, Yinfei Xu, HUI ZHANG, Xu Yang, Xin Geng

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

The action quality assessment (AQA) of videos is a challenging vision task since the relation between videos and action scores is difficult to model. Thus, AQA has been widely studied in the literature. Traditionally, AQA is treated as a regression problem to learn the underlying mappings between videos and action scores. But previous methods ignored data uncertainty in AQA dataset. To address aleatoric uncertainty, we further develop a plug-and-play module Distribution Auto-Encoder (DAE). Specifically, it encodes videos into distributions and uses the reparameterization trick in variational auto-encoders (VAE) to sample scores, which establishes a more accurate mapping between videos and scores. Meanwhile, a likelihood loss is used to learn the uncertainty parameters. We plug our DAE approach into MUSDL and CoRe. Experimental results on public datasets demonstrate that our method achieves state-of-the-art on AQA-7, MTL-AQA, and JIGSAWS datasets. Our code is available at https://github.com/InfoX-SEU/DAE-AQA.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
AQA-7DAE-CoReSpearman Correlation85.2Unverified
AQA-7DAE-MLPSpearman Correlation82.58Unverified
JIGSAWSDAE-MTSpearman Correlation0.76Unverified
JIGSAWSDAE-CoReSpearman Correlation0.86Unverified
JIGSAWSDAE-MLPSpearman Correlation0.72Unverified
MTL-AQADAE-MTSpearman Correlation94.52Unverified
MTL-AQADAE-MLPSpearman Correlation92.31Unverified
MTL-AQADAE-CoReSpearman Correlation95.89Unverified

Reproductions