SOTAVerified

Temporal Modulation Network for Controllable Space-Time Video Super-Resolution

2021-04-21CVPR 2021Code Available1· sign in to hype

Gang Xu, Jun Xu, Zhen Li, Liang Wang, Xing Sun, Ming-Ming Cheng

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Space-time video super-resolution (STVSR) aims to increase the spatial and temporal resolutions of low-resolution and low-frame-rate videos. Recently, deformable convolution based methods have achieved promising STVSR performance, but they could only infer the intermediate frame pre-defined in the training stage. Besides, these methods undervalued the short-term motion cues among adjacent frames. In this paper, we propose a Temporal Modulation Network (TMNet) to interpolate arbitrary intermediate frame(s) with accurate high-resolution reconstruction. Specifically, we propose a Temporal Modulation Block (TMB) to modulate deformable convolution kernels for controllable feature interpolation. To well exploit the temporal information, we propose a Locally-temporal Feature Comparison (LFC) module, along with the Bi-directional Deformable ConvLSTM, to extract short-term and long-term motion cues in videos. Experiments on three benchmark datasets demonstrate that our TMNet outperforms previous STVSR methods. The code is available at https://github.com/CS-GangXu/TMNet.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
MSU Super-Resolution for Video CompressionTMNet + x264BSQ-rate over ERQA1.88Unverified
MSU Super-Resolution for Video CompressionTMNet + uavs3eBSQ-rate over ERQA13.19Unverified
MSU Super-Resolution for Video CompressionTMNet + x265BSQ-rate over ERQA13.58Unverified
MSU Super-Resolution for Video CompressionTMNet + vvencBSQ-rate over ERQA21.3Unverified
MSU Super-Resolution for Video CompressionTMNet + aomencBSQ-rate over ERQA21.8Unverified
MSU Video Super Resolution Benchmark: Detail RestorationTMNetSubjective score6Unverified

Reproductions