| Attention-Based End-to-End Speech Recognition on Voice Search | Jul 22, 2017 | DecoderL2 Regularization | —Unverified | 0 | 0 |
| Audio Adversarial Examples: Attacks Using Vocal Masks | Feb 4, 2021 | Adversarial AttackSpeech-to-Text | —Unverified | 0 | 0 |
| Audio Interval Retrieval using Convolutional Neural Networks | Sep 21, 2021 | Audio ClassificationRetrieval | —Unverified | 0 | 0 |
| AudioPaLM: A Large Language Model That Can Speak and Listen | Jun 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Automated Testing of AI Models | Oct 7, 2021 | FairnessSpeech-to-Text | —Unverified | 0 | 0 |
| A Voice Controlled E-Commerce Web Application | Nov 16, 2018 | Medical Diagnosisspeech-recognition | —Unverified | 0 | 0 |
| Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM | Feb 24, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 | 0 |
| BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge | Jan 29, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models | Apr 25, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Bridging the Modality Gap for Speech-to-Text Translation | Oct 28, 2020 | DecoderSpeech-to-Text | —Unverified | 0 | 0 |
| BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? | Jun 11, 2024 | Contrastive LearningSpeech Synthesis | —Unverified | 0 | 0 |
| Challenges and Opportunities of Speech Recognition for Bengali Language | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Characterizing Financial Market Coverage using Artificial Intelligence | Feb 7, 2023 | Speech-to-Text | —Unverified | 0 | 0 |
| CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training | May 27, 2023 | intent-classificationIntent Classification | —Unverified | 0 | 0 |
| Class-Conditional Defense GAN Against End-to-End Speech Attacks | Oct 22, 2020 | Generative Adversarial NetworkSentence | —Unverified | 0 | 0 |
| Cross-lingual topic prediction for speech using translations | Aug 29, 2019 | HumanitarianPrediction | —Unverified | 0 | 0 |
| Clinical Dialogue Transcription Error Correction using Seq2Seq Models | May 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Cloud-Based Face and Speech Recognition for Access Control Applications | Apr 23, 2020 | Face Recognitionspeech-recognition | —Unverified | 0 | 0 |
| CMU's IWSLT 2024 Simultaneous Speech Translation System | Aug 14, 2024 | DecoderSpeech-to-Text | —Unverified | 0 | 0 |
| CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders | Sep 14, 2023 | Contrastive LearningKnowledge Distillation | —Unverified | 0 | 0 |
| Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks | Jan 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Compact Speech Translation Models via Discrete Speech Units Pretraining | Feb 29, 2024 | DecoderSelf-Supervised Learning | —Unverified | 0 | 0 |
| Comparison of SVD and factorized TDNN approaches for speech to text | Oct 13, 2021 | Speech-to-Text | —Unverified | 0 | 0 |
| Open Brain AI. Automatic Language Assessment | Jun 11, 2023 | Speech-to-Text | —Unverified | 0 | 0 |
| Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model | Oct 24, 2024 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| Contextualized Spoken Word Representations from Convolutional Autoencoders | Jul 6, 2020 | Speech-to-TextWord Embeddings | —Unverified | 0 | 0 |
| Conversational Recommendation System using NLP and Sentiment Analysis | May 17, 2025 | Conversational RecommendationDynamic Time Warping | —Unverified | 0 | 0 |
| Corpus Creation and Evaluation for Speech-to-Text and Speech Translation | Aug 1, 2021 | Machine TranslationSpeech-to-Text | —Unverified | 0 | 0 |
| COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning | Nov 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving | Jun 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Crossing the SSH Bridge with Interview Data | May 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Cross-modal Contrastive Learning for Speech Translation | Dec 17, 2021 | Contrastive LearningRetrieval | —Unverified | 0 | 0 |
| Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing | Sep 27, 2023 | DecoderMachine Translation | —Unverified | 0 | 0 |
| Multilingual Speech Translation with Efficient Finetuning of Pretrained Models | Oct 24, 2020 | Cross-Lingual TransferDecoder | —Unverified | 0 | 0 |
| CTC Alignments Improve Autoregressive Translation | Oct 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR | Nov 7, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| DARTS: Dialectal Arabic Transcription System | Sep 26, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning | Nov 11, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems | Oct 13, 2021 | SentenceSimultaneous Speech-to-Text Translation | —Unverified | 0 | 0 |
| DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems | Dec 13, 2018 | Deep LearningSpeech-to-Text | —Unverified | 0 | 0 |
| Deepfake audio as a data augmentation technique for training automatic speech to text transcription models | Sep 22, 2023 | Data AugmentationFace Swapping | —Unverified | 0 | 0 |
| Deep Learning Based Natural Language Processing for End to End Speech Translation | Aug 9, 2018 | Speech-to-TextTranslation | —Unverified | 0 | 0 |
| Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents | Apr 3, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| Design of a novel Korean learning application for efficient pronunciation correction | May 4, 2022 | Sentencespeech-recognition | —Unverified | 0 | 0 |
| Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network | Jun 17, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution | Sep 27, 2023 | Machine TranslationManagement | —Unverified | 0 | 0 |
| Development of Natural Language Processing Tools for Cook Islands M\=aori | Dec 1, 2018 | Machine TranslationPart-Of-Speech Tagging | —Unverified | 0 | 0 |
| Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum | Oct 18, 2024 | Speech-to-Text | —Unverified | 0 | 0 |
| Digits micro-model for accurate and secure transactions | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |