SOTAVerified

Automatic Speech Recognition

Papers

Showing 426450 of 3174 papers

TitleStatusHype
Acoustic to Articulatory Inversion of Speech; Data Driven Approaches, Challenges, Applications, and Future Scope0
Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning0
Spatial Audio Processing with Large Language Model on Wearable Devices0
Visual-Aware Speech Recognition for Noisy Scenarios0
DoCIA: An Online Document-Level Context Incorporation Agent for Speech TranslationCode0
LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect0
Chain of Correction for Full-text Speech Recognition with Large Language Models0
Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition SystemsCode0
The Impact of Code-switched Synthetic Data Quality is Task Dependent: Insights from MT and ASR0
VALLR: Visual ASR Language Model for Lip Reading0
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications0
Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization0
Whispering in Amharic: Fine-tuning Whisper for Low-resource Language0
Your voice is your voice: Supporting Self-expression through Speech Generation and LLMs in Augmented and Alternative Communication0
Evaluating ASR Confidence Scores for Automated Error Detection in User-Assisted Correction Interfaces0
Halving transcription time: A fast, user-friendly and GDPR-compliant workflow to create AI-assisted transcripts for content analysis0
Enhancing Aviation Communication Transcription: Fine-Tuning Distil-Whisper with LoRA0
ValSub: Subsampling Validation Data to Mitigate Forgetting during ASR Personalization0
Everything Can Be Described in Words: A Simple Unified Multi-Modal Framework with Semantic and Temporal Alignment0
An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR0
Building English ASR model with regional language support0
Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling0
From Voice to Safety: Language AI Powered Pilot-ATC Communication Understanding for Airport Surface Movement Collision Risk Assessment0
Qieemo: Speech Is All You Need in the Emotion Recognition in Conversations0
Direct Speech to Speech Translation: A Review0
Show:102550
← PrevPage 18 of 127Next →

No leaderboard results yet.