SOTAVerified

Audio Signal Processing

This is a general task that covers transforming audio inputs into audio outputs, not limited to existing PaperWithCode categories of Source Separation, Denoising, Classification, Recognition, etc.

Papers

Showing 150 of 70 papers

TitleStatusHype
High Fidelity Neural Audio CompressionCode4
TorchFX: A modern approach to Audio DSP with PyTorch and GPU accelerationCode2
A Survey on Data Augmentation in Large Model EraCode2
Manikin-Recorded Cardiopulmonary Sounds Dataset Using Digital StethoscopeCode1
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal SegmentationCode1
HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing AidsCode1
Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal TransportCode1
MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature FusionCode1
Sound2Synth: Interpreting Sound via FM Synthesizer Parameters EstimationCode1
Differentiable Signal Processing With Black-Box Audio EffectsCode1
DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing from Decentralised DataCode1
L3DAS21 Challenge: Machine Learning for 3D Audio Signal ProcessingCode1
Upsampling artifacts in neural audio synthesisCode1
Exploring Quality and Generalizability in Parameterized Neural Audio EffectsCode1
SignalTrain: Profiling Audio Compressors with Deep Neural NetworksCode1
DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models0
Sound Field Estimation: Theories and Applications0
Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement0
Comparative Analysis of Mel-Frequency Cepstral Coefficients and Wavelet Based Audio Signal Processing for Emotion Detection and Mental Health Assessment in Spoken Speech0
Detecting abnormal heart sound using mobile phones and on-device IConNet0
Blind Localization of Early Room Reflections with Arbitrary Microphone Array0
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic EnvironmentsCode0
Classification of Heart Sounds Using Multi-Branch Deep Convolutional Network and LSTM-CNN0
AudioSetMix: Enhancing Audio-Language Datasets with LLM-Assisted Augmentations0
Comparative Study of State-based Neural Networks for Virtual Analog Audio Effects Modeling0
Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality0
Study of speaker localization under dynamic and reverberant environments0
HPCNeuroNet: Advancing Neuromorphic Audio Signal Processing with Transformer-Enhanced Spiking Neural Networks0
Neural Harmonium: An Interpretable Deep Structure for Nonlinear Dynamic System Identification with Application to Audio Processing0
Speaker localization using direct path dominance test based on sound field directivity0
Audio signal based danger detection using signal processing and deep learningCode0
Instabilities in Convnets for Raw AudioCode0
Neural Architectures Learning Fourier Transforms, Signal Processing and Much More....0
Compositional nonlinear audio signal processing with Volterra series0
Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration0
Subspace-Configurable NetworksCode0
Novel features for the detection of bearing faults in railway vehicles0
Human Behavior in the Time of COVID-19: Learning from Big Data0
Content Adaptive Front End For Audio Classification0
MYRiAD: A Multi-Array Room Acoustic DatabaseCode0
A Comparison of Audio Preprocessing Techniques and Deep Learning Algorithms for Raga Recognition0
A Unifying View on Blind Source Separation of Convolutive Mixtures based on Independent Component Analysis0
Context-sensitive neocortical neurons transform the effectiveness and efficiency of neural information processing0
Declipping of Speech Signals Using Frequency Selective Extrapolation0
Bi-Sampling Approach to Classify Music Mood leveraging Raga-Rasa Association in Indian Classical Music0
Manifold learning-supported estimation of relative transfer functions for spatial filtering0
Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement0
Blind Identification of State-Space Models in Physical Coordinates0
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data0
Visualization of Linear Operations in the Spherical Harmonics DomainCode0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.