| PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System | Sep 28, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR | Sep 28, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models | Sep 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Speech collage: code-switched audio generation by collaging monolingual corpora | Sep 27, 2023 | Audio GenerationAutomatic Speech Recognition | CodeCode Available | 1 |
| Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study | Sep 27, 2023 | Automatic Speech RecognitionKeyword Spotting | —Unverified | 0 |
| Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study | Sep 27, 2023 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition | Sep 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project | Sep 26, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Segmentation-Free Streaming Machine Translation | Sep 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference | Sep 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |