SOTAVerified

Automatic Speech Recognition

Papers

Showing 651675 of 3174 papers

TitleStatusHype
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition0
ASR Error Correction using Large Language Models0
Exploring SSL Discrete Tokens for Multilingual ASR0
CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments0
Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech0
LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation0
Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages0
NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training0
Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy?0
Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion ModelsCode0
The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language0
Full-text Error Correction for Chinese Speech Recognition with Large Language Model0
Enhancing CTC-Based Visual Speech Recognition0
Linear Time Complexity Conformers with SummaryMixing for Streaming Speech RecognitionCode0
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition0
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking0
Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic EmbeddingsCode0
NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge0
Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge0
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR0
Evaluation of real-time transcriptions using end-to-end ASR models0
Retrieval Augmented Correction of Named Entity Speech Recognition Errors0
An investigation of modularity for noise robustness in conformer-based ASR0
Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection0
What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations0
Show:102550
← PrevPage 27 of 127Next →

No leaderboard results yet.