| Enhancing Unsupervised Speech Recognition with Diffusion GANs | Mar 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition | Mar 23, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition | Mar 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-supervised Learning with Speech Modulation Dropout | Mar 22, 2023 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| Transformers in Speech Processing: A Survey | Mar 21, 2023 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations | Mar 21, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Code-Switching Text Generation and Injection in Mandarin-English ASR | Mar 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition | Mar 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Deep Learning System for Domain-specific Speech Recognition | Mar 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Visual Information Matters for ASR Error Correction | Mar 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model | Mar 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Trustera: A Live Conversation Redaction System | Mar 16, 2023 | Automatic Speech RecognitionNatural Language Understanding | —Unverified | 0 |
| HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism | Mar 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences | Mar 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving Accented Speech Recognition with Multi-Domain Training | Mar 14, 2023 | Accented Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Improving the Intent Classification accuracy in Noisy Environment | Mar 12, 2023 | Automatic Speech RecognitionClassification | —Unverified | 0 |
| Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study | Mar 12, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Transcription free filler word detection with Neural semi-CRFs | Mar 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Stabilizing Transformer Training by Preventing Attention Entropy Collapse | Mar 11, 2023 | Automatic Speech Recognitionimage-classification | CodeCode Available | 2 |
| MIXPGD: Hybrid Adversarial Training for Speech Recognition Systems | Mar 10, 2023 | Adversarial AttackAutomatic Speech Recognition | —Unverified | 0 |
| Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings | Mar 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts | Mar 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Speech Recognition: A Survey | Mar 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages | Mar 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Large Text Corpora for End-to-End Speech Summarization | Mar 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |