SOTAVerified

Audio Super-Resolution

Audio super-resolution, especially speech, refers to the process of reconstructing high-resolution music signals from their low-resolution counterparts. Essentially, it enhances the quality of a speech signal by increasing its sampling rate or bandwidth while preserving naturalness and intelligibility. A representative Github project for speech super-resolution is ClearerVoice-Studio.

Papers

Showing 2122 of 22 papers

TitleStatusHype
Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise ModulationsCode0
Audio Super Resolution using Neural NetworksCode0
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1U-NetLog-Spectral Distance3.1Unverified
2U-Net + TFiLMLog-Spectral Distance1.8Unverified
3U-Net + AFiLMLog-Spectral Distance1.7Unverified
4TUNetLog-Spectral Distance1.36Unverified
5TUNet + MSM pre-trainingLog-Spectral Distance1.28Unverified
6NVSRLog-Spectral Distance0.78Unverified
7CMGANLog-Spectral Distance0.76Unverified
#ModelMetricClaimedVerifiedStatus
1U-NetLog-Spectral Distance3.4Unverified
2U-Net + TFiLMLog-Spectral Distance2Unverified
3U-Net + AFiLMLog-Spectral Distance1.5Unverified
#ModelMetricClaimedVerifiedStatus
1U-NetLog-Spectral Distance3.2Unverified
2U-Net + TFiLMLog-Spectral Distance2.5Unverified
3U-Net + AFiLMLog-Spectral Distance2.3Unverified
#ModelMetricClaimedVerifiedStatus
1U-Net and ResNetSNR35.26Unverified