SOTAVerified

Automatic Speech Recognition

Papers

Showing 20512100 of 3174 papers

TitleStatusHype
Technology-Augmented Multilingual Communication Models: New Interaction Paradigms, Shifts in the Language Services Industry, and Implications for Training Programs0
Without Further Ado: Direct and Simultaneous Speech Translation by AppTek in 20210
Can You Hear It? Backdoor Attacks via Ultrasonic Triggers0
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition0
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning0
Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations0
OLR 2021 Challenge: Datasets, Rules and Baselines0
CarneliNet: Neural Mixture Model for Automatic Speech Recognition0
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech0
On Prosody Modeling for ASR+TTS based Voice Conversion0
Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models0
Sequence Model with Self-Adaptive Sliding Window for Efficient Spoken Document Segmentation0
A baseline model for computationally inexpensive speech recognition for Kazakh using the Coqui STT framework0
Multi-task Learning with Cross Attention for Keyword Spotting0
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording0
Zero-shot Speech Translation0
The IWSLT 2021 BUT Speech Translation Systems0
A Configurable Multilingual Model is All You Need to Recognize All Languages0
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems0
Loss Prediction: End-to-End Active Learning Approach For Speech Recognition0
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models0
Noisy Training Improves E2E ASR for the Edge0
Improved Language Identification Through Cross-Lingual Self-Supervised Learning0
Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers0
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning0
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio0
Investigation of Practical Aspects of Single Channel Speech Separation for ASR0
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech RecognitionCode0
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation0
Arabic Code-Switching Speech Recognition using Monolingual Data0
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition0
Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition0
Multi-user VoiceFilter-Lite via Attentive Speaker Embedding0
SmarTerp: A CAI System to Support Simultaneous Interpreters in Real-Time0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR0
Word-Free Spoken Language Understanding for Mandarin-Chinese0
Improving Named Entity Recognition in Spoken Dialog Systems by Context and Speech Pattern Modeling0
On joint training with interfaces for spoken language understanding0
Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models0
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task0
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding0
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus0
Where are we in semantic concept extraction for Spoken Language Understanding?0
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens0
Mixtures of Deep Neural Experts for Automated Speech Scoring0
A Discriminative Entity-Aware Language Model for Virtual Assistants0
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition0
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System0
On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech0
Show:102550
← PrevPage 42 of 64Next →

No leaderboard results yet.