SOTAVerified

Audio Super-Resolution

Audio super-resolution, especially speech, refers to the process of reconstructing high-resolution music signals from their low-resolution counterparts. Essentially, it enhances the quality of a speech signal by increasing its sampling rate or bandwidth while preserving naturalness and intelligibility. A representative Github project for speech super-resolution is ClearerVoice-Studio.

Papers

Showing 122 of 22 papers

TitleStatusHype
AudioSR: Versatile Audio Super-resolution at ScaleCode3
CMGAN: Conformer-Based Metric-GAN for Monaural Speech EnhancementCode2
AERO: Audio Super Resolution in the Spectral DomainCode2
AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space modelsCode2
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling RatesCode2
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow MatchingCode2
TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised PretrainingCode1
Learning Continuous Representation of Audio for Arbitrary Scale Super ResolutionCode1
Neural Vocoder is All You Need for Speech Super-resolutionCode1
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANsCode1
NU-Wave: A Diffusion Probabilistic Model for Neural Audio UpsamplingCode1
On Filter Generalization for Music Bandwidth Extension Using Deep Neural NetworksCode1
Self-Attention for Audio Super-ResolutionCode1
Audio Super Resolution using Neural NetworksCode0
Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise ModulationsCode0
Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations.Code0
Edge Storage Management Recipe with Zero-Shot Data Compression for Road Anomaly Detection0
FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation0
Gull: A Generative Multifunctional Audio Codec0
Learning to Have an Ear for Face Super-Resolution0
Adversarial Audio Super-Resolution with Unsupervised Feature Losses0
An investigation of pre-upsampling generative modelling and Generative Adversarial Networks in audio super resolution0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1U-NetLog-Spectral Distance3.1Unverified
2U-Net + TFiLMLog-Spectral Distance1.8Unverified
3U-Net + AFiLMLog-Spectral Distance1.7Unverified
4TUNetLog-Spectral Distance1.36Unverified
5TUNet + MSM pre-trainingLog-Spectral Distance1.28Unverified
6NVSRLog-Spectral Distance0.78Unverified
7CMGANLog-Spectral Distance0.76Unverified
#ModelMetricClaimedVerifiedStatus
1U-NetLog-Spectral Distance3.4Unverified
2U-Net + TFiLMLog-Spectral Distance2Unverified
3U-Net + AFiLMLog-Spectral Distance1.5Unverified
#ModelMetricClaimedVerifiedStatus
1U-NetLog-Spectral Distance3.2Unverified
2U-Net + TFiLMLog-Spectral Distance2.5Unverified
3U-Net + AFiLMLog-Spectral Distance2.3Unverified
#ModelMetricClaimedVerifiedStatus
1U-Net and ResNetSNR35.26Unverified