SOTAVerified
Home/Audio & Speech

Audio & Speech

239 tasks · View all areas

Papers in this area

Showing 110 of 10 papers

TitleStatusHype
Hear Your Code Fail, Voice-Assisted Debugging for Python0
NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech0
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
Autoregressive Speech Enhancement via Acoustic Tokens0
SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks0
Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine0
P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge0
Pronunciation Deviation Analysis Through Voice Cloning and Acoustic Comparison0
Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation ModelsCode0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
Show:102550
TaskPapersResults
Keyword Spotting CSS10
Music Texture Transfer

Texture is the collective temporal homogeneity of acoustic e…

10
Nepali Speech Recognition10
Parkinson Detection from Speech

Detecting Parkinson’s Disease using speech features.

10
Piano Music Modeling10
Pronunciation Dictionary Creation

Create a pronunciation dictionary

10
Real-time Directional Hearing

Directional hearing models that also support real-time on-de…

10
Recognizing Seven Different Dastgahs Of Iranian Classical Music10
Self-Supervised Sound Classification10
Semi-supervised Audio Classification10
Semi-Supervised Audio Regression10
Shadow Confidence Maps In Ultrasound Imaging10
Speaker Orientation

Direction of Voice or speaker orientation of the person with…

10
Speech Intelligibility Evaluation

In this task, the intelligibility of speech is evaluated aft…

10
Speech Synthesis - Assamese10
Speech Synthesis - Bengali10
Speech Synthesis - Bodo10
Speech Synthesis - Hindi10
Speech Synthesis - Kannada10
Speech Synthesis - Malayalam10
Speech Synthesis - Manipuri10
Speech Synthesis - Marathi10
Speech Synthesis - Rajasthani10
Speech Synthesis - Tamil10
Speech Synthesis - Telugu10
Speech to Facial Landmark10
Speech-to-Gesture Translation10
Speech-to-Phoneme10
Spoken Digits Recognition10
Streaming Target Sound Extraction

This task is a variant of the [Target Sound Extraction](http…

10
Unsupervised Anomaly Detection In Sound10
Unsupervised Few-Shot Audio Classification

In few-shot unsupervised classification, we assume that at t…

10
Video/Text-to-Audio Generation10
Zero-Shot Video-Audio Retrieval10
Detection Of Instrumentals Musical Tracks00
Hearing Aid and device processing00
Pronunciation Assessment00
Single-Label Target Sound Extraction

Single-Label Target Sound Extraction is the task of extracti…

00
Speaking Style Synthesis00