| Using Text Injection to Improve Recognition of Personal Identifiers in Speech | Aug 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder | Aug 14, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 1 |
| Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations | Aug 14, 2023 | Action DetectionActivity Detection | CodeCode Available | 0 |
| Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition | Aug 12, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss | Aug 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Novel Self-training Approach for Low-resource Speech Recognition | Aug 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio | Aug 9, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation | Aug 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Comparative Analysis of the wav2vec 2.0 Feature Extractor | Aug 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism | Aug 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |