SOTAVerified

Automatic Speech Recognition

Papers

Showing 21512200 of 3174 papers

TitleStatusHype
Personalized Keyphrase Detection using Speaker and Environment Information0
Head-synchronous Decoding for Transformer-based Streaming ASR0
Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction0
Semantic Data Augmentation for End-to-End Mandarin Speech Recognition0
Quantization of Deep Neural Networks for Accurate Edge Computing0
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models0
Disfluency Detection with Unlabeled Data and Small BERT Models0
On Sampling-Based Training Criteria for Neural Language Modeling0
Accented Speech Recognition: A Survey0
Scene-aware Far-field Automatic Speech Recognition0
Discriminative Self-training for Punctuation Prediction0
Pre-training for Spoken Language Understanding with Joint Textual and Phonetic Representation Learning0
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers0
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era0
Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition0
Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers0
Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems0
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation0
A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter ItCode0
Conditional independence for pretext task selection in Self-supervised speech representation learningCode0
Cross-domain Speech Recognition with Unsupervised Character-level Distribution MatchingCode0
Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept0
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation0
Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding0
Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search0
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures0
Innovative Bert-based Reranking Language Models for Speech Recognition0
NeMo Inverse Text Normalization: From Development To ProductionCode0
Non-autoregressive Transformer-based End-to-end ASR using BERT0
On Architectures and Training for Raw Waveform Feature Extraction in ASR0
Accented Speech Recognition Inspired by Human Perception0
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition0
Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation0
BSTC: A Large-Scale Chinese-English Speech Translation Dataset0
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems0
Capturing Multi-Resolution Context by Dilated Self-Attention0
Pushing the Limits of Non-Autoregressive Speech Recognition0
Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models0
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios0
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoringCode0
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model0
Relaxing the Conditional Independence Assumption of CTC-based ASR by Conditioning on Intermediate Predictions0
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition0
End-to-End Speaker-Attributed ASR with Transformer0
Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition0
Speaker conditioned acoustic modeling for multi-speaker conversational ASR0
Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval0
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding0
Towards Lifelong Learning of End-to-end ASR0
On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASRCode0
Show:102550
← PrevPage 44 of 64Next →

No leaderboard results yet.