| Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach | Sep 13, 2024 | In-Context LearningRetrieval | CodeCode Available | 0 |
| Evaluation of real-time transcriptions using end-to-end ASR models | Sep 9, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| LAST: Language Model Aware Speech Tokenization | Sep 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AI-Based IVR | Aug 20, 2024 | Speech SynthesisSpeech-to-Text | —Unverified | 0 |
| CMU's IWSLT 2024 Simultaneous Speech Translation System | Aug 14, 2024 | DecoderSpeech-to-Text | —Unverified | 0 |
| CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation Units | Jul 19, 2024 | Machine TranslationSpeech-to-Text | CodeCode Available | 0 |
| AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments | Jul 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Voice Command Pipelines for Drone Control: From STT and LLM to Direct Classification and Siamese Networks | Jul 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models | Jul 9, 2024 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation | Jul 4, 2024 | Machine Translationspeech-recognition | —Unverified | 0 |