SOTAVerified

Automatic Speech Recognition

Papers

Showing 15511600 of 3174 papers

TitleStatusHype
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses0
Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training0
MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues0
MeetDot: Videoconferencing with Live Translation Captions0
Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems0
Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR0
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition0
Memory-Efficient Training of RNN-Transducer with Sampled Softmax0
Memory Visualization for Gated Recurrent Neural Networks in Speech Recognition0
Mesures linguistiques automatiques pour l’évaluation des systèmes de Reconnaissance Automatique de la Parole (Automated linguistic measures for automatic speech recognition systems’ evaluation)0
Meta Auxiliary Learning for Low-resource Spoken Language Understanding0
META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR0
Meta Learning for End-to-End Low-Resource Speech Recognition0
Meta-Learning for improving rare word recognition in end-to-end ASR0
SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition0
MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, Asr Error Detection, and Asr Error Correction0
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge0
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation0
Minimally Supervised Written-to-Spoken Text Normalization0
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition0
Mitigating Noisy Inputs for Question Answering0
MIXPGD: Hybrid Adversarial Training for Speech Recognition Systems0
MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition0
Mixture Encoder for Joint Speech Separation and Recognition0
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription0
Mixture-of-Expert Conformer for Streaming Multilingual ASR0
Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition0
Mixtures of Deep Neural Experts for Automated Speech Scoring0
MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition0
ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding0
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition0
MLP-based architecture with variable length input for automatic speech recognition0
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets0
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark0
MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition0
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition0
MobileASR: A resource-aware on-device learning framework for user voice personalization applications on mobile phones0
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features0
Model Adaptation for ASR in low-resource Indian Languages0
Model-Based Approach for Measuring the Fairness in ASR0
Modeling Acoustic-Prosodic Cues for Word Importance Prediction in Spoken Dialogues0
Modeling Confidence in Sequence-to-Sequence Models0
Modeling Dependent Structure for Utterances in ASR Evaluation0
Modeling State-Conditional Observation Distribution using Weighted Stereo Samples for Factorial Speech Processing Models0
Modelling prosodic structure using Artificial Neural Networks0
Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word Model0
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition0
Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models0
Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries0
Monolingual Recognizers Fusion for Code-switching Speech Recognition0
Show:102550
← PrevPage 32 of 64Next →

No leaderboard results yet.