SOTAVerified

Automatic Speech Recognition

Papers

Showing 21512200 of 3174 papers

TitleStatusHype
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation0
Fast Development of ASR in African Languages using Self Supervised Speech Representation LearningCode1
EdgeCRNN: an edgecomputing oriented model of acoustic feature enhancement for keyword spotting0
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition0
A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training0
Learning Word-Level Confidence For Subword End-to-End ASR0
Best of Both Worlds: Robust Accented Speech Recognition with Adversarial Transfer Learning0
Contrastive Semi-supervised Learning for ASR0
A Parallelizable Lattice Rescoring Strategy with Neural Language ModelsCode3
An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios0
WaveGuard: Understanding and Mitigating Audio Adversarial ExamplesCode1
Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration0
Incorporating VAD into ASR System by Multi-task Learning0
Brain Signals to Rescue Aphasia, Apraxia and Dysarthria Speech Recognition0
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition0
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event LocalizationCode0
Meta-Learning for improving rare word recognition in end-to-end ASR0
MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition0
Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks0
Thoughts on the potential to compensate a hearing loss in noise0
Evolutionary optimization of contexts for phonetic correction in speech recognition systems0
Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial DomainCode0
Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model0
Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition0
Gaussian Kernelized Self-Attention for Long Sequence Data and Its Application to CTC-based Speech Recognition0
Echo State Speech Recognition0
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems0
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition0
End-to-End Automatic Speech Recognition with Deep Mutual Learning0
Improving speech recognition models with small samples for air traffic control systems0
Hierarchical Transformer-based Large-Context End-to-end ASR with Large-Context Knowledge Distillation0
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition0
Multimodal Punctuation Prediction with Contextual Dropout0
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding0
Hybrid phonetic-neural model for correction in speech recognition systemsCode0
Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-training and Its Application to Children's ASR0
Transformer Language Models with LSTM-based Cross-utterance Information RepresentationCode1
Content-Aware Speaker Embeddings for Speaker Diarisation0
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
NUVA: A Naming Utterance Verifier for Aphasia Treatment0
Dompteur: Taming Audio Adversarial ExamplesCode1
Sparsification via Compressed Sensing for Automatic Speech Recognition0
Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers0
BembaSpeech: A Speech Recognition Corpus for the Bemba LanguageCode1
Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced LanguagesCode0
A bandit approach to curriculum generation for automatic speech recognition0
Multi-Task Self-Supervised Pre-Training for Music Classification0
Intermediate Loss Regularization for CTC-based Speech Recognition0
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR0
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with SubwordsCode0
Show:102550
← PrevPage 44 of 64Next →

No leaderboard results yet.