| Accented Speech Recognition With Accent-specific Codebooks | Oct 24, 2023 | Accented Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 1 |
| Advancing Test-Time Adaptation in Wild Acoustic Test Settings | Oct 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| HowToCaption: Prompting LLMs to Transform Video Annotations at Scale | Oct 7, 2023 | Automatic Speech RecognitionVideo Captioning | CodeCode Available | 1 |
| Speech collage: code-switched audio generation by collaging monolingual corpora | Sep 27, 2023 | Audio GenerationAutomatic Speech Recognition | CodeCode Available | 1 |
| HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models | Sep 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Memory-augmented conformer for improved end-to-end long-form ASR | Sep 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| HypR: A comprehensive study for ASR hypothesis revising with a reference corpus | Sep 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Unimodal Aggregation for CTC-based Speech Recognition | Sep 15, 2023 | Automatic Speech RecognitionDecoder | CodeCode Available | 1 |
| DiaCorrect: Error Correction Back-end For Speaker Diarization | Sep 15, 2023 | Automatic Speech RecognitionDecoder | CodeCode Available | 1 |
| EnCodecMAE: Leveraging neural codecs for universal audio representation learning | Sep 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |