| A Survey on Speech Large Language Models | Oct 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model | Oct 24, 2024 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| STTATTS: Unified Speech-To-Text And Text-To-Speech Model | Oct 24, 2024 | Multi-Task Learningspeech-recognition | CodeCode Available | 1 |
| Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum | Oct 18, 2024 | Speech-to-Text | —Unverified | 0 |
| Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck | Oct 15, 2024 | Speech-to-Text | —Unverified | 0 |
| Denial-of-Service Poisoning Attacks against Large Language Models | Oct 14, 2024 | 16kSpeech-to-Text | CodeCode Available | 1 |
| Unsupervised Data Validation Methods for Efficient Model Training | Oct 10, 2024 | Data Augmentationmodel | —Unverified | 0 |
| Transducer Consistency Regularization for Speech to Text Applications | Oct 9, 2024 | Model OptimizationSpeech-to-Text | —Unverified | 0 |
| Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems | Oct 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unveiling the Role of Pretraining in Direct Speech Translation | Sep 26, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |