SOTAVerified

ML-CrAIST: Multi-scale Low-high Frequency Information-based Cross black Attention with Image Super-resolving Transformer

2024-08-19Code Available0· sign in to hype

Alik Pramanick, Utsav Bheda, Arijit Sur

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Recently, transformers have captured significant interest in the area of single-image super-resolution tasks, demonstrating substantial gains in performance. Current models heavily depend on the network's extensive ability to extract high-level semantic details from images while overlooking the effective utilization of multi-scale image details and intermediate information within the network. Furthermore, it has been observed that high-frequency areas in images present significant complexity for super-resolution compared to low-frequency areas. This work proposes a transformer-based super-resolution architecture called ML-CrAIST that addresses this gap by utilizing low-high frequency information in multiple scales. Unlike most of the previous work (either spatial or channel), we operate spatial and channel self-attention, which concurrently model pixel interaction from both spatial and channel dimensions, exploiting the inherent correlations across spatial and channel axis. Further, we devise a cross-attention block for super-resolution, which explores the correlations between low and high-frequency information. Quantitative and qualitative assessments indicate that our proposed ML-CrAIST surpasses state-of-the-art super-resolution methods (e.g., 0.15 dB gain @Manga109 4). Code is available on: https://github.com/Alik033/ML-CrAIST.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
2x upscalingML-CrAIST-Li#params (K)743Unverified
2x upscalingML-CrAIST#params (K)1,259Unverified
3x upscalingML-CrAIST#params (K)1,268Unverified
3x upscalingML-CrAIST-Li#params (K)749Unverified
4x upscalingML-CrAIST#params (K)1,280Unverified
4x upscalingML-CrAIST-Li#params (K)758Unverified
B100 - 2x upscalingML-CrAISTSSIM0.9Unverified
B100 - 2x upscalingML-CrAIST-LiSSIM0.9Unverified
B100 - 3x upscalingML-CrAISTSSIM0.81Unverified
B100 - 3x upscalingML-CrAIST-LiSSIM0.81Unverified
B100 - 4x upscalingML-CrAIST-LiPSNR27.73Unverified
B100 - 4x upscalingML-CrAISTPSNR27.78Unverified
Manga109 - 2x upscalingML-CrAIST-LiPSNR39.23Unverified
Manga109 - 2x upscalingML-CrAISTPSNR39.26Unverified
Manga109 - 3x upscalingML-CrAISTPSNR34.42Unverified
Manga109 - 3x upscalingML-CrAIST-LiPSNR34.26Unverified
Manga109 - 4x upscalingML-CrAISTSSIM0.92Unverified
Manga109 - 4x upscalingML-CrAIST-LiSSIM0.92Unverified
Set14 - 2x upscalingML-CrAIST-LiPSNR33.64Unverified
Set14 - 2x upscalingML-CrAISTPSNR33.77Unverified
Set14 - 3x upscalingML-CrAISTPSNR30.39Unverified
Set14 - 3x upscalingML-CrAIST-LiPSNR30.23Unverified
Set14 - 4x upscalingML-CrAIST-LiPSNR28.4Unverified
Set14 - 4x upscalingML-CrAISTPSNR28.53Unverified
Set5 - 2x upscalingML-CrAISTPSNR38.19Unverified
Set5 - 2x upscalingML-CrAIST-LiPSNR38.15Unverified
Set5 - 3x upscalingML-CrAIST-LiPSNR34.58Unverified
Set5 - 3x upscalingML-CrAISTPSNR34.7Unverified
Set5 - 4x upscalingML-CrAISTPSNR32.36Unverified
Set5 - 4x upscalingML-CrAIST-LiPSNR32.15Unverified
Urban100 - 2x upscalingML-CrAIST-LiPSNR32.93Unverified
Urban100 - 2x upscalingML-CrAISTPSNR33.04Unverified
Urban100 - 3x upscalingML-CrAISTPSNR28.89Unverified
Urban100 - 3x upscalingML-CrAIST-LiPSNR28.73Unverified
Urban100 - 4x upscalingML-CrAISTPSNR26.68Unverified
Urban100 - 4x upscalingML-CrAIST-LiPSNR26.53Unverified

Reproductions