Home/Audio & Speech

Audio & Speech

Papers in this area

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 10 papers

Title	Date	Tasks	Status
Hear Your Code Fail, Voice-Assisted Debugging for Python	Jul 20, 2025	CPUMedical Diagnosis	—Unverified
NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech	Jul 17, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
MUPAX: Multidimensional Problem Agnostic eXplainable AI	Jul 17, 2025	Anatomical Landmark DetectionAudio Classification	—Unverified
Autoregressive Speech Enhancement via Acoustic Tokens	Jul 17, 2025	Speech Enhancement	—Unverified
SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks	Jul 17, 2025	DeepFake DetectionFace Swapping	—Unverified
Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine	Jul 17, 2025	Audio ClassificationAutomatic Speech Recognition	—Unverified
P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge	Jul 15, 2025	Speech Enhancementtext-to-speech	—Unverified
Pronunciation Deviation Analysis Through Voice Cloning and Acoustic Comparison	Jul 15, 2025	Voice Cloning	—Unverified
Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models	Jul 15, 2025	Audio Source Separationblind source separation	CodeCode Available
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments	Jul 14, 2025	Speech-to-Texttext-to-speech	—Unverified

Show:10 25 50

Task	Papers	Results
Keyword Spotting CSS	1	0
Music Texture Transfer Texture is the collective temporal homogeneity of acoustic e…	1	0
Nepali Speech Recognition	1	0
Parkinson Detection from Speech Detecting Parkinson’s Disease using speech features.	1	0
Piano Music Modeling	1	0
Pronunciation Dictionary Creation Create a pronunciation dictionary	1	0
Real-time Directional Hearing Directional hearing models that also support real-time on-de…	1	0
Recognizing Seven Different Dastgahs Of Iranian Classical Music	1	0
Self-Supervised Sound Classification	1	0
Semi-supervised Audio Classification	1	0
Semi-Supervised Audio Regression	1	0
Shadow Confidence Maps In Ultrasound Imaging	1	0
Speaker Orientation Direction of Voice or speaker orientation of the person with…	1	0
Speech Intelligibility Evaluation In this task, the intelligibility of speech is evaluated aft…	1	0
Speech Synthesis - Assamese	1	0
Speech Synthesis - Bengali	1	0
Speech Synthesis - Bodo	1	0
Speech Synthesis - Hindi	1	0
Speech Synthesis - Kannada	1	0
Speech Synthesis - Malayalam	1	0
Speech Synthesis - Manipuri	1	0
Speech Synthesis - Marathi	1	0
Speech Synthesis - Rajasthani	1	0
Speech Synthesis - Tamil	1	0
Speech Synthesis - Telugu	1	0
Speech to Facial Landmark	1	0
Speech-to-Gesture Translation	1	0
Speech-to-Phoneme	1	0
Spoken Digits Recognition	1	0
Streaming Target Sound Extraction This task is a variant of the [Target Sound Extraction](http…	1	0
Unsupervised Anomaly Detection In Sound	1	0
Unsupervised Few-Shot Audio Classification In few-shot unsupervised classification, we assume that at t…	1	0
Video/Text-to-Audio Generation	1	0
Zero-Shot Video-Audio Retrieval	1	0
Detection Of Instrumentals Musical Tracks	0	0
Hearing Aid and device processing	0	0
Pronunciation Assessment	0	0
Single-Label Target Sound Extraction Single-Label Target Sound Extraction is the task of extracti…	0	0
Speaking Style Synthesis	0	0