SOTAVerified

Automatic Speech Recognition

Papers

Showing 801850 of 3174 papers

TitleStatusHype
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion TechniquesCode0
Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR0
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets0
Reading Miscue Detection in Primary School through Automatic Speech Recognition0
Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter0
AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection0
Tag and correct: high precision post-editing approach to correction of speech recognition errors0
ASTRA: Aligning Speech and Text Representations for Asr without Sampling0
MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations0
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR0
Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis0
To Distill or Not to Distill? On the Robustness of Robust Knowledge DistillationCode0
Flexible Multichannel Speech Enhancement for Noise-Robust Frontend0
Hypernetworks for Personalizing ASR to Atypical Speech0
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores0
Joint Beam Search Integrating CTC, Attention, and Transducer Decoders0
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition0
Enhancing CTC-based speech recognition with diverse modeling units0
Error-preserving Automatic Speech Recognition of Young English Learners' LanguageCode0
Text Injection for Neural Contextual Biasing0
Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision0
Keyword-Guided Adaptation of Automatic Speech Recognition0
Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping0
Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach0
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning0
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities0
Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation0
Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous ClientsCode0
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation0
Contextualized Automatic Speech Recognition with Dynamic Vocabulary0
You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish0
FairLENS: Assessing Fairness in Law Enforcement Speech Recognition0
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models0
Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings0
Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer0
Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants0
SpeechVerse: A Large-scale Generalizable Audio Language Model0
Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech0
Open Implementation and Study of BEST-RQ for Speech Processing0
MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition0
Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition0
Efficient Compression of Multitask Multilingual Speech Models0
Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features0
Sequence-to-sequence models in peer-to-peer learning: A practical application0
Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation0
Automatic Speech Recognition System-Independent Word Error Rate Estimation0
U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF0
Developing Acoustic Models for Automatic Speech Recognition in Swedish0
Show:102550
← PrevPage 17 of 64Next →

No leaderboard results yet.