| Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation | Dec 6, 2019 | FormMachine Translation | CodeCode Available | 0 | 5 |
| Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach | Sep 13, 2024 | In-Context LearningRetrieval | CodeCode Available | 0 | 5 |
| fairseq S2T: Fast Speech-to-Text Modeling with fairseq | Oct 11, 2020 | Machine TranslationMulti-Task Learning | CodeCode Available | 0 | 5 |
| M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation | Jul 3, 2022 | DecoderSpeech-to-Text | CodeCode Available | 0 | 5 |
| Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN | Jul 24, 2023 | Automatic Speech RecognitionSentiment Analysis | CodeCode Available | 0 | 5 |
| Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks | Feb 19, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| Pre-training on high-resource speech recognition improves low-resource speech-to-text translation | Sep 5, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| LibriS2S: A German-English Speech-to-Speech Translation Corpus | Apr 22, 2022 | Speech-to-Speech TranslationSpeech-to-Text | CodeCode Available | 0 | 5 |
| Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models | Jul 9, 2024 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 | 5 |
| Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning | Sep 21, 2016 | DecoderMulti-Task Learning | CodeCode Available | 0 | 5 |
| Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision | Dec 30, 2023 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 | 5 |
| Kurdish (Sorani) Speech to Text: Presenting an Experimental Dataset | Nov 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation | Dec 6, 2016 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 | 5 |
| Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation | Dec 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Jan 10, 2025 | Automatic Speech RecognitionClassification | CodeCode Available | 0 | 5 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | Jun 29, 2022 | Intent ClassificationSlot Filling | CodeCode Available | 0 | 5 |
| FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild | Jan 8, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition | Dec 23, 2021 | BenchmarkingDeep Learning | CodeCode Available | 0 | 5 |
| Scribosermo: Fast Speech-to-Text models for German and other Languages | Oct 15, 2021 | Speech RecognitionSpeech-to-Text | CodeCode Available | 0 | 5 |
| Challenges and Opportunities of Speech Recognition for Bengali Language | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Efficient Monotonic Multihead Attention | Dec 7, 2023 | Simultaneous Speech-to-Text TranslationSpeech-to-Text | —Unverified | 0 | 0 |
| Effectively pretraining a speech translation decoder with Machine Translation data | Nov 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? | Jun 11, 2024 | Contrastive LearningSpeech Synthesis | —Unverified | 0 | 0 |
| Application of Audio Fingerprinting Techniques for Real-Time Scalable Speech Retrieval and Speech Clusterization | Oct 29, 2024 | GPURetrieval | —Unverified | 0 | 0 |
| A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks | Oct 21, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR | Jun 11, 2021 | Simultaneous Speech-to-Text TranslationSpeech-to-Text | —Unverified | 0 | 0 |
| Application-Agnostic Language Modeling for On-Device ASR | May 16, 2023 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 | 0 |
| Direct Punjabi to English speech translation using discrete units | Feb 25, 2024 | Speech-to-Speech TranslationSpeech-to-Text | —Unverified | 0 | 0 |
| Digits micro-model for accurate and secure transactions | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Bridging the Modality Gap for Speech-to-Text Translation | Oct 28, 2020 | DecoderSpeech-to-Text | —Unverified | 0 | 0 |
| Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum | Oct 18, 2024 | Speech-to-Text | —Unverified | 0 | 0 |
| Development of Natural Language Processing Tools for Cook Islands M\=aori | Dec 1, 2018 | Machine TranslationPart-Of-Speech Tagging | —Unverified | 0 | 0 |
| Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models | Apr 25, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy | Oct 13, 2022 | Generative Adversarial NetworkSpeaker anonymization | —Unverified | 0 | 0 |
| AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR | Sep 30, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Comparative Study on End-to-end Speech to Text Translation | Nov 20, 2019 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 | 0 |
| Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution | Sep 27, 2023 | Machine TranslationManagement | —Unverified | 0 | 0 |
| Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network | Jun 17, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| Design of a novel Korean learning application for efficient pronunciation correction | May 4, 2022 | Sentencespeech-recognition | —Unverified | 0 | 0 |
| Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents | Apr 3, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge | Jan 29, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Deep Learning Based Natural Language Processing for End to End Speech Translation | Aug 9, 2018 | Speech-to-TextTranslation | —Unverified | 0 | 0 |
| Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM | Feb 24, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 | 0 |
| An Experiment on Speech-to-Text Translation Systems for Manipuri to English on Low Resource Setting | Dec 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Adversarial Attacks against Neural Networks in Audio Domain: Exploiting Principal Components | Jul 14, 2020 | ClassificationGeneral Classification | —Unverified | 0 | 0 |
| Deepfake audio as a data augmentation technique for training automatic speech to text transcription models | Sep 22, 2023 | Data AugmentationFace Swapping | —Unverified | 0 | 0 |
| DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems | Dec 13, 2018 | Deep LearningSpeech-to-Text | —Unverified | 0 | 0 |
| Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems | Oct 13, 2021 | SentenceSimultaneous Speech-to-Text Translation | —Unverified | 0 | 0 |
| Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning | Nov 11, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |