| Improving Metrics for Speech Translation | May 22, 2023 | Speech-to-TextTranslation | —Unverified | 0 |
| Application-Agnostic Language Modeling for On-Device ASR | May 16, 2023 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks | May 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Autoregressive NLP Tasks via Modular Linearized Attention | Apr 17, 2023 | Computational EfficiencyMachine Translation | —Unverified | 0 |
| Enhancing Speech-to-Speech Translation with Multiple TTS Targets | Apr 10, 2023 | Speech-to-Speech TranslationSpeech-to-Text | —Unverified | 0 |
| ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit | Apr 10, 2023 | BenchmarkingSimultaneous Speech-to-Text Translation | CodeCode Available | 0 |
| Natural Language Robot Programming: NLP integrated with autonomous robotic grasping | Apr 6, 2023 | Robotic GraspingSpeech-to-Text | —Unverified | 0 |
| Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R | Mar 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts | Mar 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages | Mar 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction | Feb 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Characterizing Financial Market Coverage using Artificial Intelligence | Feb 7, 2023 | Speech-to-Text | —Unverified | 0 |
| Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition | Jan 6, 2023 | Domain AdaptationGPU | —Unverified | 0 |
| Pushing the performances of ASR models on English and Spanish accents | Dec 22, 2022 | Speech-to-Text | —Unverified | 0 |
| WACO: Word-Aligned Contrastive Learning for Speech Translation | Dec 19, 2022 | Contrastive LearningSpeech-to-Text | CodeCode Available | 0 |
| M3ST: Mix at Three Levels for Speech Translation | Dec 7, 2022 | Data AugmentationDiversity | —Unverified | 0 |
| MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition | Nov 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Handling and extracting key entities from customer conversations using Speech recognition and Named Entity recognition | Nov 28, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search | Oct 31, 2022 | Emotion RecognitionNeural Architecture Search | —Unverified | 0 |
| Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili | Oct 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Speech Translation with Dynamic Latent Perceivers | Oct 28, 2022 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 |
| Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text Translation | Oct 24, 2022 | SegmentationSpeech-to-Text | CodeCode Available | 0 |
| Named Entity Detection and Injection for Direct Speech Translation | Oct 21, 2022 | SentenceSpeech-to-Text | —Unverified | 0 |
| Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses | Oct 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Simple and Effective Unsupervised Speech Translation | Oct 18, 2022 | Domain AdaptationMachine Translation | —Unverified | 0 |
| Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy | Oct 13, 2022 | Generative Adversarial NetworkSpeaker anonymization | CodeCode Available | 0 |
| CTC Alignments Improve Autoregressive Translation | Oct 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training | Oct 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Speech-to-Text and Evaluation of Multiple Machine Translation Systems | Sep 1, 2022 | Machine TranslationSpeech-to-Text | —Unverified | 0 |
| Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks | Aug 25, 2022 | Machine TranslationPart-Of-Speech Tagging | —Unverified | 0 |
| Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech | Aug 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Extending RNN-T-based speech recognition systems with emotion and language classification | Jul 28, 2022 | Emotion ClassificationEmotion Recognition | —Unverified | 0 |
| RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks | Jul 14, 2022 | Speech-to-Text | —Unverified | 0 |
| M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation | Jul 3, 2022 | DecoderSpeech-to-Text | CodeCode Available | 0 |
| System Description on Automatic Simultaneous Translation Workshop | Jul 1, 2022 | SentenceSpeech-to-Text | —Unverified | 0 |
| Swiss German Speech to Text system evaluation | Jul 1, 2022 | Speech-to-Text | —Unverified | 0 |
| Findings of the Third Workshop on Automatic Simultaneous Translation | Jul 1, 2022 | Speech-to-TextTranslation | —Unverified | 0 |
| Language Model Augmented Monotonic Attention for Simultaneous Translation | Jul 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | Jun 29, 2022 | Intent ClassificationSlot Filling | CodeCode Available | 0 |
| Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network | Jun 17, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Revisiting End-to-End Speech-to-Text Translation From Scratch | Jun 9, 2022 | Decoderspeech-recognition | CodeCode Available | 0 |
| Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool | Jun 1, 2022 | Sign Language TranslationSpeech-to-Text | —Unverified | 0 |
| A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking | Jun 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Nós Project: Opening routes for the Galician language in the field of language technologies | Jun 1, 2022 | Cultural Vocal Bursts Intensity PredictionMachine Translation | —Unverified | 0 |
| Clinical Dialogue Transcription Error Correction using Seq2Seq Models | May 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Semantic-preserved Communication System for Highly Efficient Speech Transmission | May 25, 2022 | Semantic Communicationspeech-recognition | —Unverified | 0 |
| SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation | May 17, 2022 | Representation LearningRetrieval | —Unverified | 0 |
| Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language | May 6, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Design of a novel Korean learning application for efficient pronunciation correction | May 4, 2022 | Sentencespeech-recognition | —Unverified | 0 |