| DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set | Oct 30, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili | Oct 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition | Oct 28, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition | Oct 28, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contextual-Utterance Training for Automatic Speech Recognition | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Simulating realistic speech overlaps improves multi-talker ASR | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SAN: a robust end-to-end ASR model architecture | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| There is more than one kind of robustness: Fooling Whisper with adversarial examples | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| UFO2: A unified pre-training framework for online and offline speech recognition | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Utilization of Large Pre-Trained Models for Low Resource ASR | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |