| Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR | Feb 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification | Feb 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena | Feb 20, 2024 | Automatic Speech Recognitionimage-classification | —Unverified | 0 |
| Ain't Misbehavin' -- Using LLMs to Generate Expressive Robot Behavior in Conversations with the Tabletop Robot Haru | Feb 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models | Feb 14, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| An Embarrassingly Simple Approach for LLM with Strong ASR Capacity | Feb 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension | Feb 12, 2024 | 2kAutomatic Speech Recognition | CodeCode Available | 2 |
| The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese | Feb 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with Large Language Models | Feb 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-consistent context aware conformer transducer for speech recognition | Feb 9, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |