SOTAVerified

Automatic Speech Recognition

Papers

Showing 776800 of 3174 papers

TitleStatusHype
Performant ASR Models for Medical Entities in Accented Speech0
Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of SpeechCode0
Automatic Speech Recognition for Biomedical Data in Bengali Language0
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving0
Large Language Models for Dysfluency Detection in Stuttered Speech0
Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition0
Optimized Speculative Sampling for GPU Hardware AcceleratorsCode0
Learning Language Structures through Grounding0
Optimizing Byte-level Representation for End-to-end ASR0
Inclusive ASR for Disfluent Speech: Cascaded Large-Scale Self-Supervised Learning with Targeted Fine-Tuning and Data Augmentation0
ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR0
An efficient text augmentation approach for contextualized Mandarin speech recognition0
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn'tCode0
Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition0
Multi-Modal Retrieval For Large Language Model Based Speech Recognition0
The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments0
Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment0
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion0
Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data0
Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR0
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion TechniquesCode0
Towards Unsupervised Speech Recognition Without Pronunciation ModelsCode0
Guiding Frame-Level CTC Alignments Using Self-knowledge DistillationCode0
Transformer-based Model for ASR N-Best Rescoring and Rewriting0
Show:102550
← PrevPage 32 of 127Next →

No leaderboard results yet.