SOTAVerified

Automatic Speech Recognition

Papers

Showing 29513000 of 3174 papers

TitleStatusHype
Semantically Meaningful Metrics for Norwegian ASR SystemsCode0
BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech RecognitionCode0
BehancePR: A Punctuation Restoration Dataset for Livestreaming Video TranscriptCode0
Enhancing Quantised End-to-End ASR Models via PersonalisationCode0
Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition modelsCode0
Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech RecognitionCode0
HydraFormer: One Encoder For All Subsampling RatesCode0
Semantic Mask for Transformer based End-to-End Speech RecognitionCode0
What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-trainingCode0
Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation NetworkCode0
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential DataCode0
Hybrid phonetic-neural model for correction in speech recognition systemsCode0
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanismCode0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC VideosCode0
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and SubtitlingCode0
DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distributionCode0
NeMo Inverse Text Normalization: From Development To ProductionCode0
End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive EnvelopesCode0
Attention-based Multi-hypothesis Fusion for Speech SummarizationCode0
DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European LanguagesCode0
Neural Architecture Search For LF-MMI Trained Time Delay Neural NetworksCode0
Speech Translation Refinement using Large Language ModelsCode0
BERT Attends the Conversation: Improving Low-Resource Conversational ASRCode0
Direct Segmentation Models for Streaming Speech TranslationCode0
Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning FusionCode0
Did you hear that? Adversarial Examples Against Automatic Speech RecognitionCode0
Training dynamic models using early exits for automatic speech recognition on resource-constrained devicesCode0
DiaCorrect: End-to-end error correction for speaker diarizationCode0
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-trainingCode0
Neural NILM: Deep Neural Networks Applied to Energy DisaggregationCode0
End-to-End Open Vocabulary Keyword Search With Multilingual Neural RepresentationsCode0
Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-LabelingCode0
TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASRCode0
Word-level Embeddings for Cross-Task Transfer Learning in Speech ProcessingCode0
Unsupervised Online Continual Learning for Automatic Speech RecognitionCode0
Human Transcription Quality ImprovementCode0
Light Gated Recurrent Units for Speech RecognitionCode0
HuBERT-EE: Early Exiting HuBERT for Efficient Speech RecognitionCode0
How You Say It Matters: Measuring the Impact of Verbal Disfluency Tags on Automated Dementia DetectionCode0
Quantifying Bias in Automatic Speech RecognitionCode0
To Distill or Not to Distill? On the Robustness of Robust Knowledge DistillationCode0
Attentional Speech Recognition Models Misbehave on Out-of-domain UtterancesCode0
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech RecognitionCode0
Linear Time Complexity Conformers with SummaryMixing for Streaming Speech RecognitionCode0
Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of WolofCode0
Quaternion Convolutional Neural Networks for End-to-End Automatic Speech RecognitionCode0
Text-Based Detection of On-Hold Scripts in Contact Center CallsCode0
Quaternion Recurrent Neural NetworksCode0
How Phonotactics Affect Multilingual and Zero-shot ASR PerformanceCode0
Show:102550
← PrevPage 60 of 64Next →

No leaderboard results yet.