| UFO2: A unified pre-training framework for online and offline speech recognition | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Utilization of Large Pre-Trained Models for Low Resource ASR | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Monotonic segmental attention for automatic speech recognition | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition | Oct 25, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Does Joint Training Really Help Cascaded Speech Translation? | Oct 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Time-Domain Speech Enhancement for Robust Automatic Speech Recognition | Oct 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation | Oct 24, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |