SOTAVerified

Automatic Speech Recognition

Papers

Showing 19011950 of 3174 papers

TitleStatusHype
iRNN: Integer-only Recurrent Neural Network0
Audio-Visual Speech Recognition is Worth 32328 Voxels0
MeetDot: Videoconferencing with Live Translation Captions0
Model-Based Approach for Measuring the Fairness in ASR0
Multimodal Audio-textual Architecture for Robust Spoken Language Understanding0
Utterance-level neural confidence measure for end-to-end children speech recognition0
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription0
Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning0
Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning0
Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech0
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech RecognitionCode1
Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition0
Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search DegradationCode0
Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages0
Remember the context! ASR slot error correction through memorization0
Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition0
Vietnamese end-to-end speech recognition using wav2vec 2.0Code1
Coarse-To-Fine And Cross-Lingual ASR Transfer0
Robustness of end-to-end Automatic Speech Recognition Models – A Case Study using Mozilla DeepSpeech0
Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition0
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognitionCode1
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding0
Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech0
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition0
4-bit Quantization of LSTM-based Speech Recognition Models0
Improving callsign recognition with air-surveillance data in air-traffic communication0
Task-aware Warping Factors in Mask-based Speech Enhancement0
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR0
Reducing Exposure Bias in Training Recurrent Neural Network Transducers0
Automatic Speech Recognition And Limited Vocabulary: A Survey0
A Unified Transformer-based Framework for Duplex Text Normalization0
Hierarchical Summarization for Longform Spoken Dialog0
A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition0
A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems0
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition0
End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive EnvelopesCode0
The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation0
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent ClassificationCode1
Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features0
Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation0
Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification0
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation0
Amortized Neural Networks for Low-Latency Speech Recognition0
Learning a Neural Diff for Speech Models0
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
Decoupling recognition and transcription in Mandarin ASR0
Automatic recognition of suprasegmentals in speech0
Interactive Reinforcement Learning for Table Balancing Robot0
ZJU’s IWSLT 2021 Speech Translation System0
IMS’ Systems for the IWSLT 2021 Low-Resource Speech Translation Task0
Show:102550
← PrevPage 39 of 64Next →

No leaderboard results yet.