Learning Parallax Attention for Stereo Image Super-Resolution
Longguang Wang, Yingqian Wang, Zhengfa Liang, Zaiping Lin, Jungang Yang, Wei An, Yulan Guo
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/LongguangWang/PASSRnetOfficialpytorch★ 0
Abstract
Stereo image pairs can be used to improve the performance of super-resolution (SR) since additional information is provided from a second viewpoint. However, it is challenging to incorporate this information for SR since disparities between stereo images vary significantly. In this paper, we propose a parallax-attention stereo superresolution network (PASSRnet) to integrate the information from a stereo image pair for SR. Specifically, we introduce a parallax-attention mechanism with a global receptive field along the epipolar line to handle different stereo images with large disparity variations. We also propose a new and the largest dataset for stereo image SR (namely, Flickr1024). Extensive experiments demonstrate that the parallax-attention mechanism can capture correspondence between stereo images to improve SR performance with a small computational and memory cost. Comparative results show that our PASSRnet achieves the state-of-the-art performance on the Middlebury, KITTI 2012 and KITTI 2015 datasets.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| KITTI 2012 - 2x upscaling | PASSRnet | PSNR | 30.65 | — | Unverified |
| KITTI 2012 - 4x upscaling | PASSRnet | PSNR | 26.26 | — | Unverified |
| KITTI 2015 - 2x upscaling | PASSRnet | PSNR | 29.78 | — | Unverified |
| KITTI 2015 - 4x upscaling | PASSRnet | PSNR | 25.43 | — | Unverified |
| Middlebury - 2x upscaling | PASSRnet | PSNR | 34.05 | — | Unverified |
| Middlebury - 4x upscaling | PASSRnet | PSNR | 28.63 | — | Unverified |