SOTAVerified

Automatic Speech Recognition

Papers

Showing 24512500 of 3174 papers

TitleStatusHype
Unveiling Biases while Embracing Sustainability: Assessing the Dual Challenges of Automatic Speech Recognition Systems0
Unveiling the Role of Pretraining in Direct Speech Translation0
Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models0
Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification?0
Use of Knowledge Graph in Rescoring the N-Best List in Automatic Speech Recognition0
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis0
Using Automatic Speech Recognition in Spoken Corpus Curation0
Using English Acoustic Models for Hindi Automatic Speech Recognition0
Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition0
Using Kaldi for Automatic Speech Recognition of Conversational Austrian German0
Using Large Language Model for End-to-End Chinese ASR and NER0
Using multiple ASR hypotheses to boost i18n NLU performance0
Using multi-task learning to improve the performance of acoustic-to-word and conventional hybrid models0
Using Spoken Word Posterior Features in Neural Machine Translation0
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems0
Using Text Injection to Improve Recognition of Personal Identifiers in Speech0
Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation0
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models0
Utterance-level neural confidence measure for end-to-end children speech recognition0
Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones0
V2S attack: building DNN-based voice conversion from automatic speaker verification0
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording0
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition0
VAIS ASR: Building a conversational speech recognition system using language model combination0
VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages0
VALLR: Visual ASR Language Model for Lip Reading0
ValSub: Subsampling Validation Data to Mitigate Forgetting during ASR Personalization0
VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition0
V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization0
VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining0
VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition0
ViraPart: A Text Refinement Framework for Automatic Speech Recognition and Natural Language Processing Tasks in Persian0
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech0
Visual-Aware Speech Recognition for Noisy Scenarios0
Visual Information Matters for ASR Error Correction0
Visualizing Automatic Speech Recognition -- Means for a Better Understanding?0
VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis0
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer0
Voice Privacy with Smart Digital Assistants in Educational Settings0
Voice Quality and Pitch Features in Transformer-Based Speech Recognition0
Voice Query Auto Completion0
VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System0
VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka0
VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing0
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment0
Warped Language Models for Noise Robust Language Understanding0
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR0
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning0
wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts0
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR0
Show:102550
← PrevPage 50 of 64Next →

No leaderboard results yet.