SOTAVerified

Automatic Speech Recognition

Papers

Showing 16011625 of 3174 papers

TitleStatusHype
An Analysis of Semantically-Aligned Speech-Text Embeddings0
End-to-end model for named entity recognition from speech without paired training data0
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation0
PriMock57: A Dataset Of Primary Care Mock ConsultationsCode1
Alternate Intermediate Conditioning with Syllable-level and Character-level Targets for Japanese ASR0
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation0
End-to-End Multi-speaker ASR with Independent Vector Analysis0
Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding0
Text-To-Speech Data Augmentation for Low Resource Speech Recognition0
Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition0
Probing Speech Emotion Recognition Transformers for Linguistic Knowledge0
Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives0
indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languagesCode1
How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control CommunicationsCode1
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset0
Memory-Efficient Training of RNN-Transducer with Sampled Softmax0
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings0
Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition0
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data0
Analyzing the factors affecting usefulness of Self-Supervised Pre-trained Representations for Speech Recognition0
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionCode1
Improving Speech Recognition for Indic Languages using Language Model0
Streaming Speaker-Attributed ASR with Token-Level Speaker EmbeddingsCode1
Code Switched and Code Mixed Speech Recognition for Indic languages0
Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech RecognitionCode0
Show:102550
← PrevPage 65 of 127Next →

No leaderboard results yet.