SOTAVerified

Automatic Speech Recognition

Papers

Showing 20012025 of 3174 papers

TitleStatusHype
Audio-Visual Speech Recognition is Worth 32328 Voxels0
iRNN: Integer-only Recurrent Neural Network0
MeetDot: Videoconferencing with Live Translation Captions0
Model-Based Approach for Measuring the Fairness in ASR0
Multimodal Audio-textual Architecture for Robust Spoken Language Understanding0
Utterance-level neural confidence measure for end-to-end children speech recognition0
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription0
Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning0
Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning0
Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition0
Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech0
Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search DegradationCode0
Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages0
Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition0
Remember the context! ASR slot error correction through memorization0
Coarse-To-Fine And Cross-Lingual ASR Transfer0
Robustness of end-to-end Automatic Speech Recognition Models – A Case Study using Mozilla DeepSpeech0
Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition0
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding0
Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech0
Task-aware Warping Factors in Mask-based Speech Enhancement0
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition0
Improving callsign recognition with air-surveillance data in air-traffic communication0
4-bit Quantization of LSTM-based Speech Recognition Models0
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR0
Show:102550
← PrevPage 81 of 127Next →

No leaderboard results yet.