SOTAVerified

Automatic Speech Recognition

Papers

Showing 21512200 of 3174 papers

TitleStatusHype
LV-CTC: Non-autoregressive ASR with CTC and latent variable models0
Lyrics-to-Audio Alignment by Unsupervised Discovery of Repetitive Patterns in Vowel Acoustics0
Machine Speech Chain with One-shot Speaker Adaptation0
MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition0
Magic dust for cross-lingual adaptation of monolingual wav2vec-2.00
Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian0
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation0
Malayalam Speech Corpus: Design and Development for Dravidian Language0
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation0
Mandarin-English Code-Switching Speech Recognition System for Specific Domain0
ManWav: The First Manchu ASR Model0
Masked Audio Text Encoders are Effective Multi-Modal Rescorers0
Mask scalar prediction for improving robust automatic speech recognition0
MASRI-HEADSET: A Maltese Corpus for Speech Recognition0
Massive End-to-end Models for Short Search Queries0
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters0
Massively Multilingual Shallow Fusion with Large Language Models0
Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning0
MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into LaTeX Formulas for Improved Readability0
Maximum a Posteriori Adaptation of Network Parameters in Deep Models0
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses0
Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training0
MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues0
MeetDot: Videoconferencing with Live Translation Captions0
Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems0
Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR0
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition0
Memory-Efficient Training of RNN-Transducer with Sampled Softmax0
Memory Visualization for Gated Recurrent Neural Networks in Speech Recognition0
Mesures linguistiques automatiques pour l’évaluation des systèmes de Reconnaissance Automatique de la Parole (Automated linguistic measures for automatic speech recognition systems’ evaluation)0
Meta Auxiliary Learning for Low-resource Spoken Language Understanding0
META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR0
Meta Learning for End-to-End Low-Resource Speech Recognition0
Meta-Learning for improving rare word recognition in end-to-end ASR0
SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition0
MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, Asr Error Detection, and Asr Error Correction0
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge0
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation0
Minimally Supervised Written-to-Spoken Text Normalization0
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition0
Mitigating Noisy Inputs for Question Answering0
MIXPGD: Hybrid Adversarial Training for Speech Recognition Systems0
MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition0
Mixture Encoder for Joint Speech Separation and Recognition0
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription0
Mixture-of-Expert Conformer for Streaming Multilingual ASR0
Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition0
Mixtures of Deep Neural Experts for Automated Speech Scoring0
MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition0
ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding0
Show:102550
← PrevPage 44 of 64Next →

No leaderboard results yet.