Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation
Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, LiMin Wang
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/mcg-nju/ema-vfiOfficialIn paperpytorch★ 493
Abstract
Effectively extracting inter-frame motion and appearance information is important for video frame interpolation (VFI). Previous works either extract both types of information in a mixed way or elaborate separate modules for each type of information, which lead to representation ambiguity and low efficiency. In this paper, we propose a novel module to explicitly extract motion and appearance information via a unifying operation. Specifically, we rethink the information process in inter-frame attention and reuse its attention map for both appearance feature enhancement and motion information extraction. Furthermore, for efficient VFI, our proposed module could be seamlessly integrated into a hybrid CNN and Transformer architecture. This hybrid pipeline can alleviate the computational complexity of inter-frame attention as well as preserve detailed low-level structure information. Experimental results demonstrate that, for both fixed- and arbitrary-timestep interpolation, our method achieves state-of-the-art performance on various datasets. Meanwhile, our approach enjoys a lighter computation overhead over models with close performance. The source code and models are available at https://github.com/MCG-NJU/EMA-VFI.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| MSU Video Frame Interpolation | EMA-VFI | PSNR | 29.89 | — | Unverified |
| SNU-FILM (easy) | EMA-VFI | PSNR | 39.98 | — | Unverified |
| SNU-FILM (extreme) | EMA-VFI | PSNR | 25.69 | — | Unverified |
| SNU-FILM (hard) | EMA-VFI | PSNR | 30.94 | — | Unverified |
| SNU-FILM (medium) | EMA-VFI | PSNR | 36.09 | — | Unverified |
| UCF101 | EMA-VFI | PSNR | 35.48 | — | Unverified |
| Vimeo90K | EMA-VFI | PSNR | 36.64 | — | Unverified |
| X4K1000FPS | EMA-VFI | PSNR | 31.46 | — | Unverified |
| X4K1000FPS-2K | EMA-VFI | PSNR | 32.85 | — | Unverified |
| Xiph-2K | EMA-VFI | PSNR | 36.9 | — | Unverified |
| Xiph-4k | EMA-VFI | PSNR | 34.67 | — | Unverified |