| Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks | Aug 25, 2022 | Machine TranslationPart-Of-Speech Tagging | —Unverified | 0 |
| Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech | Aug 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Extending RNN-T-based speech recognition systems with emotion and language classification | Jul 28, 2022 | Emotion ClassificationEmotion Recognition | —Unverified | 0 |
| RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks | Jul 14, 2022 | Speech-to-Text | —Unverified | 0 |
| M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation | Jul 3, 2022 | DecoderSpeech-to-Text | CodeCode Available | 0 |
| Language Model Augmented Monotonic Attention for Simultaneous Translation | Jul 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| System Description on Automatic Simultaneous Translation Workshop | Jul 1, 2022 | SentenceSpeech-to-Text | —Unverified | 0 |
| Findings of the Third Workshop on Automatic Simultaneous Translation | Jul 1, 2022 | Speech-to-TextTranslation | —Unverified | 0 |
| Swiss German Speech to Text system evaluation | Jul 1, 2022 | Speech-to-Text | —Unverified | 0 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | Jun 29, 2022 | Intent ClassificationSlot Filling | CodeCode Available | 0 |
| Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network | Jun 17, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Revisiting End-to-End Speech-to-Text Translation From Scratch | Jun 9, 2022 | Decoderspeech-recognition | —Unverified | 0 |
| The Nós Project: Opening routes for the Galician language in the field of language technologies | Jun 1, 2022 | Cultural Vocal Bursts Intensity PredictionMachine Translation | —Unverified | 0 |
| Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool | Jun 1, 2022 | Sign Language TranslationSpeech-to-Text | —Unverified | 0 |
| A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking | Jun 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Clinical Dialogue Transcription Error Correction using Seq2Seq Models | May 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Semantic-preserved Communication System for Highly Efficient Speech Transmission | May 25, 2022 | Semantic Communicationspeech-recognition | —Unverified | 0 |
| PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit | May 20, 2022 | AllAutomatic Speech Recognition (ASR) | CodeCode Available | 6 |
| SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation | May 17, 2022 | Representation LearningRetrieval | —Unverified | 0 |
| Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language | May 6, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Cross-modal Contrastive Learning for Speech Translation | May 5, 2022 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| Design of a novel Korean learning application for efficient pronunciation correction | May 4, 2022 | Sentencespeech-recognition | —Unverified | 0 |
| Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages | May 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation | May 1, 2022 | SegmentationSimultaneous Speech-to-Text Translation | —Unverified | 0 |
| NAIST Simultaneous Speech-to-Text Translation System for IWSLT 2022 | May 1, 2022 | SegmentationSimultaneous Speech-to-Text Translation | —Unverified | 0 |
| The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation | May 1, 2022 | ChunkingSentence | —Unverified | 0 |
| The AISP-SJTU Simultaneous Translation System for IWSLT 2022 | May 1, 2022 | Speech-to-TextTranslation | —Unverified | 0 |
| LibriS2S: A German-English Speech-to-Speech Translation Corpus | Apr 22, 2022 | Speech-to-Speech TranslationSpeech-to-Text | CodeCode Available | 0 |
| WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment | Apr 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation | Apr 6, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems | Apr 4, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents | Apr 3, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| The MIT Voice Name System | Mar 28, 2022 | Speech-to-Text | —Unverified | 0 |
| A Dataset for Speech Emotion Recognition in Greek Theatrical Plays | Mar 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| XTREME-S: Evaluating Cross-lingual Speech Representations | Mar 21, 2022 | Representation LearningRetrieval | —Unverified | 0 |
| STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation | Mar 20, 2022 | Machine TranslationSpeech-to-Text | CodeCode Available | 1 |
| A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing | Mar 18, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 1 |
| A combined approach to the analysis of speech conversations in a contact center domain | Mar 12, 2022 | Speech-to-Text | —Unverified | 0 |
| Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems | Mar 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Which French speech recognition system for assistant robots? | Mar 4, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments | Feb 21, 2022 | Data AugmentationPhoneme Recognition | CodeCode Available | 0 |
| Punctuation restoration in Swedish through fine-tuned KB-BERT | Feb 14, 2022 | Language ModellingPunctuation Restoration | —Unverified | 0 |
| Semantic-aware Speech to Text Transmission with Redundancy Removal | Feb 7, 2022 | Semantic CommunicationSpeech-to-Text | —Unverified | 0 |
| Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility | Feb 5, 2022 | Speech EnhancementSpeech-to-Text | —Unverified | 0 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 |
| A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture | Jan 6, 2022 | Speech-to-Texttext-to-speech | CodeCode Available | 0 |
| InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition | Dec 23, 2021 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement | Dec 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Cross-modal Contrastive Learning for Speech Translation | Dec 17, 2021 | Contrastive LearningRetrieval | —Unverified | 0 |
| X-Vector based voice activity detection for multi-genre broadcast speech-to-text | Dec 9, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |