| End-to-End Automatic Speech Translation of Audiobooks | Feb 12, 2018 | automatic-speech-translationSpeech-to-Text | CodeCode Available | 0 |
| Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation | Feb 9, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| End to End ASR System with Automatic Punctuation Insertion | Dec 3, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN | Jul 24, 2023 | Automatic Speech RecognitionSentiment Analysis | CodeCode Available | 0 |
| Audio Adversarial Examples: Targeted Attacks on Speech-to-Text | Jan 5, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Efficient Speech Translation with Dynamic Latent Perceivers | Oct 28, 2022 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 |
| Careless Whisper: Speech-to-Text Hallucination Harms | Feb 12, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation | Dec 6, 2019 | FormMachine Translation | CodeCode Available | 0 |
| Towards End-to-end Speech-to-text Summarization | Jun 6, 2023 | Abstractive Text SummarizationSpeech-to-Text | CodeCode Available | 0 |
| Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text Translation | Oct 24, 2022 | SegmentationSpeech-to-Text | CodeCode Available | 0 |
| Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin | Oct 21, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding | Dec 16, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Dataset for Speech Emotion Recognition in Greek Theatrical Plays | Mar 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| WACO: Word-Aligned Contrastive Learning for Speech Translation | Dec 19, 2022 | Contrastive LearningSpeech-to-Text | CodeCode Available | 0 |
| Transformer-Based Named Entity Recognition for Automated Server Provisioning | Apr 1, 2025 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features | Sep 20, 2016 | Speech-to-TextTranslation | CodeCode Available | 0 |
| Attentively Embracing Noise for Robust Latent Representation in BERT | Dec 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training | Oct 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Scribosermo: Fast Speech-to-Text models for German and other Languages | Oct 15, 2021 | Speech RecognitionSpeech-to-Text | CodeCode Available | 0 |
| FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild | Jan 8, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Direct speech-to-speech translation with a sequence-to-sequence model | Apr 12, 2019 | Speech SynthesisSpeech-to-Speech Translation | CodeCode Available | 0 |
| mask-Net: Learning Context Aware Invariant Features using Adversarial Forgetting (Student Abstract) | Nov 25, 2020 | Speech-to-Text | CodeCode Available | 0 |
| An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation | Aug 28, 2023 | Machine TranslationNMT | CodeCode Available | 0 |
| SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition | Apr 5, 2021 | speech-recognitionSpeech Recognition | CodeCode Available | 0 |
| Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions | Feb 13, 2018 | BIG-bench Machine LearningManagement | CodeCode Available | 0 |