| Convoifilter: A case study of doing cocktail party speech recognition | Aug 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SeamlessM4T: Massively Multilingual & Multimodal Machine Translation | Aug 22, 2023 | Automatic Speech RecognitionMachine Translation | CodeCode Available | 2 |
| TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition | Aug 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Indonesian Automatic Speech Recognition with XLSR-53 | Aug 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bayes Risk Transducer: Transducer with Controllable Alignment Prediction | Aug 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals | Aug 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Accurate synthesis of Dysarthric Speech for ASR data augmentation | Aug 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations | Aug 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving CTC-AED model with integrated-CTC and auxiliary loss regularization | Aug 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Using Text Injection to Improve Recognition of Personal Identifiers in Speech | Aug 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Text Injection for Capitalization and Turn-Taking Prediction in Speech Models | Aug 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder | Aug 14, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 1 |
| Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations | Aug 14, 2023 | Action DetectionActivity Detection | CodeCode Available | 0 |
| Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition | Aug 12, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss | Aug 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Novel Self-training Approach for Low-resource Speech Recognition | Aug 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio | Aug 9, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Comparative Analysis of the wav2vec 2.0 Feature Extractor | Aug 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation | Aug 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism | Aug 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging | Aug 5, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Federated Representation Learning for Automatic Speech Recognition | Aug 3, 2023 | Automatic Speech RecognitionFederated Learning | —Unverified | 0 |
| Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification | Aug 2, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text | Jul 30, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus | Jul 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |