ML-CrAIST: Multi-scale Low-high Frequency Information-based Cross black Attention with Image Super-resolving Transformer
Alik Pramanick, Utsav Bheda, Arijit Sur
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/alik033/ml-craistOfficialIn paperpytorch★ 5
Abstract
Recently, transformers have captured significant interest in the area of single-image super-resolution tasks, demonstrating substantial gains in performance. Current models heavily depend on the network's extensive ability to extract high-level semantic details from images while overlooking the effective utilization of multi-scale image details and intermediate information within the network. Furthermore, it has been observed that high-frequency areas in images present significant complexity for super-resolution compared to low-frequency areas. This work proposes a transformer-based super-resolution architecture called ML-CrAIST that addresses this gap by utilizing low-high frequency information in multiple scales. Unlike most of the previous work (either spatial or channel), we operate spatial and channel self-attention, which concurrently model pixel interaction from both spatial and channel dimensions, exploiting the inherent correlations across spatial and channel axis. Further, we devise a cross-attention block for super-resolution, which explores the correlations between low and high-frequency information. Quantitative and qualitative assessments indicate that our proposed ML-CrAIST surpasses state-of-the-art super-resolution methods (e.g., 0.15 dB gain @Manga109 4). Code is available on: https://github.com/Alik033/ML-CrAIST.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 2x upscaling | ML-CrAIST-Li | #params (K) | 743 | — | Unverified |
| 2x upscaling | ML-CrAIST | #params (K) | 1,259 | — | Unverified |
| 3x upscaling | ML-CrAIST | #params (K) | 1,268 | — | Unverified |
| 3x upscaling | ML-CrAIST-Li | #params (K) | 749 | — | Unverified |
| 4x upscaling | ML-CrAIST | #params (K) | 1,280 | — | Unverified |
| 4x upscaling | ML-CrAIST-Li | #params (K) | 758 | — | Unverified |
| B100 - 2x upscaling | ML-CrAIST | SSIM | 0.9 | — | Unverified |
| B100 - 2x upscaling | ML-CrAIST-Li | SSIM | 0.9 | — | Unverified |
| B100 - 3x upscaling | ML-CrAIST | SSIM | 0.81 | — | Unverified |
| B100 - 3x upscaling | ML-CrAIST-Li | SSIM | 0.81 | — | Unverified |
| B100 - 4x upscaling | ML-CrAIST-Li | PSNR | 27.73 | — | Unverified |
| B100 - 4x upscaling | ML-CrAIST | PSNR | 27.78 | — | Unverified |
| Manga109 - 2x upscaling | ML-CrAIST-Li | PSNR | 39.23 | — | Unverified |
| Manga109 - 2x upscaling | ML-CrAIST | PSNR | 39.26 | — | Unverified |
| Manga109 - 3x upscaling | ML-CrAIST | PSNR | 34.42 | — | Unverified |
| Manga109 - 3x upscaling | ML-CrAIST-Li | PSNR | 34.26 | — | Unverified |
| Manga109 - 4x upscaling | ML-CrAIST | SSIM | 0.92 | — | Unverified |
| Manga109 - 4x upscaling | ML-CrAIST-Li | SSIM | 0.92 | — | Unverified |
| Set14 - 2x upscaling | ML-CrAIST-Li | PSNR | 33.64 | — | Unverified |
| Set14 - 2x upscaling | ML-CrAIST | PSNR | 33.77 | — | Unverified |
| Set14 - 3x upscaling | ML-CrAIST | PSNR | 30.39 | — | Unverified |
| Set14 - 3x upscaling | ML-CrAIST-Li | PSNR | 30.23 | — | Unverified |
| Set14 - 4x upscaling | ML-CrAIST-Li | PSNR | 28.4 | — | Unverified |
| Set14 - 4x upscaling | ML-CrAIST | PSNR | 28.53 | — | Unverified |
| Set5 - 2x upscaling | ML-CrAIST | PSNR | 38.19 | — | Unverified |
| Set5 - 2x upscaling | ML-CrAIST-Li | PSNR | 38.15 | — | Unverified |
| Set5 - 3x upscaling | ML-CrAIST-Li | PSNR | 34.58 | — | Unverified |
| Set5 - 3x upscaling | ML-CrAIST | PSNR | 34.7 | — | Unverified |
| Set5 - 4x upscaling | ML-CrAIST | PSNR | 32.36 | — | Unverified |
| Set5 - 4x upscaling | ML-CrAIST-Li | PSNR | 32.15 | — | Unverified |
| Urban100 - 2x upscaling | ML-CrAIST-Li | PSNR | 32.93 | — | Unverified |
| Urban100 - 2x upscaling | ML-CrAIST | PSNR | 33.04 | — | Unverified |
| Urban100 - 3x upscaling | ML-CrAIST | PSNR | 28.89 | — | Unverified |
| Urban100 - 3x upscaling | ML-CrAIST-Li | PSNR | 28.73 | — | Unverified |
| Urban100 - 4x upscaling | ML-CrAIST | PSNR | 26.68 | — | Unverified |
| Urban100 - 4x upscaling | ML-CrAIST-Li | PSNR | 26.53 | — | Unverified |