SOTAVerified

Automatic Speech Recognition

Papers

Showing 19512000 of 3174 papers

TitleStatusHype
On Knowledge Distillation for Translating Erroneous Speech Transcriptions0
ON-TRAC’ systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks0
Without Further Ado: Direct and Simultaneous Speech Translation by AppTek in 20210
How Might We Create Better Benchmarks for Speech Recognition?0
Technology-Augmented Multilingual Communication Models: New Interaction Paradigms, Shifts in the Language Services Industry, and Implications for Training Programs0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus0
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition ExperimentsCode1
The History of Speech Recognition to the Year 2030Code1
Can You Hear It? Backdoor Attacks via Ultrasonic Triggers0
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition0
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning0
Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations0
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0Code1
OLR 2021 Challenge: Datasets, Rules and Baselines0
CarneliNet: Neural Mixture Model for Automatic Speech Recognition0
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech0
On Prosody Modeling for ASR+TTS based Voice Conversion0
Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models0
Sequence Model with Self-Adaptive Sliding Window for Efficient Spoken Document Segmentation0
A baseline model for computationally inexpensive speech recognition for Kazakh using the Coqui STT framework0
Token-Level Supervised Contrastive Learning for Punctuation RestorationCode1
STRODE: Stochastic Boundary Ordinary Differential EquationCode1
A Comparison of Methods for OOV-word Recognition on a New Public DatasetCode1
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording0
Multi-task Learning with Cross Attention for Keyword Spotting0
The IWSLT 2021 BUT Speech Translation Systems0
A Configurable Multilingual Model is All You Need to Recognize All Languages0
Zero-shot Speech Translation0
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems0
Layer-wise Analysis of a Self-supervised Speech Representation ModelCode1
Loss Prediction: End-to-End Active Learning Approach For Speech Recognition0
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models0
Noisy Training Improves E2E ASR for the Edge0
Improved Language Identification Through Cross-Lingual Self-Supervised Learning0
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning0
Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers0
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio0
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech RecognitionCode0
Investigation of Practical Aspects of Single Channel Speech Separation for ASR0
Arabic Code-Switching Speech Recognition using Monolingual Data0
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation0
TENET: A Time-reversal Enhancement Network for Noise-robust ASRCode1
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition0
Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech RecognitionCode1
Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition0
Multi-user VoiceFilter-Lite via Attentive Speaker Embedding0
SmarTerp: A CAI System to Support Simultaneous Interpreters in Real-Time0
Improving Named Entity Recognition in Spoken Dialog Systems by Context and Speech Pattern Modeling0
StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR0
Show:102550
← PrevPage 40 of 64Next →

No leaderboard results yet.