| Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments | Feb 21, 2022 | Data AugmentationPhoneme Recognition | CodeCode Available | 0 |
| Punctuation restoration in Swedish through fine-tuned KB-BERT | Feb 14, 2022 | Language ModellingPunctuation Restoration | —Unverified | 0 |
| Semantic-aware Speech to Text Transmission with Redundancy Removal | Feb 7, 2022 | Semantic CommunicationSpeech-to-Text | —Unverified | 0 |
| Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility | Feb 5, 2022 | Speech EnhancementSpeech-to-Text | —Unverified | 0 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 |
| A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture | Jan 6, 2022 | Speech-to-Texttext-to-speech | CodeCode Available | 0 |
| InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition | Dec 23, 2021 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement | Dec 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Cross-modal Contrastive Learning for Speech Translation | Dec 17, 2021 | Contrastive LearningRetrieval | —Unverified | 0 |
| X-Vector based voice activity detection for multi-genre broadcast speech-to-text | Dec 9, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |