SOTAVerified

Automatic Speech Recognition

Papers

Showing 20012050 of 3174 papers

TitleStatusHype
Audio-Visual Speech Recognition is Worth 32328 Voxels0
iRNN: Integer-only Recurrent Neural Network0
MeetDot: Videoconferencing with Live Translation Captions0
Model-Based Approach for Measuring the Fairness in ASR0
Multimodal Audio-textual Architecture for Robust Spoken Language Understanding0
Utterance-level neural confidence measure for end-to-end children speech recognition0
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription0
Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning0
Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning0
Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition0
Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech0
Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search DegradationCode0
Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages0
Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition0
Remember the context! ASR slot error correction through memorization0
Coarse-To-Fine And Cross-Lingual ASR Transfer0
Robustness of end-to-end Automatic Speech Recognition Models – A Case Study using Mozilla DeepSpeech0
Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition0
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding0
Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech0
Task-aware Warping Factors in Mask-based Speech Enhancement0
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition0
Improving callsign recognition with air-surveillance data in air-traffic communication0
4-bit Quantization of LSTM-based Speech Recognition Models0
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR0
Reducing Exposure Bias in Training Recurrent Neural Network Transducers0
Automatic Speech Recognition And Limited Vocabulary: A Survey0
A Unified Transformer-based Framework for Duplex Text Normalization0
Hierarchical Summarization for Longform Spoken Dialog0
A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition0
A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems0
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition0
End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive EnvelopesCode0
The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation0
Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation0
Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification0
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation0
Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features0
Learning a Neural Diff for Speech Models0
Amortized Neural Networks for Low-Latency Speech Recognition0
Decoupling recognition and transcription in Mandarin ASR0
Automatic recognition of suprasegmentals in speech0
ZJU’s IWSLT 2021 Speech Translation System0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
How Might We Create Better Benchmarks for Speech Recognition?0
IMS’ Systems for the IWSLT 2021 Low-Resource Speech Translation Task0
Interactive Reinforcement Learning for Table Balancing Robot0
On Knowledge Distillation for Translating Erroneous Speech Transcriptions0
ON-TRAC’ systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks0
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus0
Show:102550
← PrevPage 41 of 64Next →

No leaderboard results yet.