| Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deepfake audio as a data augmentation technique for training automatic speech to text transcription models | Sep 22, 2023 | Data AugmentationFace Swapping | —Unverified | 0 |
| Deep Learning Based Natural Language Processing for End to End Speech Translation | Aug 9, 2018 | Speech-to-TextTranslation | —Unverified | 0 |
| Multilingual Speech Translation with Efficient Finetuning of Pretrained Models | Oct 24, 2020 | Cross-Lingual TransferDecoder | —Unverified | 0 |
| Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents | Apr 3, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing | Sep 27, 2023 | DecoderMachine Translation | —Unverified | 0 |
| Cross-modal Contrastive Learning for Speech Translation | Dec 17, 2021 | Contrastive LearningRetrieval | —Unverified | 0 |
| Design of a novel Korean learning application for efficient pronunciation correction | May 4, 2022 | Sentencespeech-recognition | —Unverified | 0 |
| Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions | Jun 30, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution | Sep 27, 2023 | Machine TranslationManagement | —Unverified | 0 |
| Handling and extracting key entities from customer conversations using Speech recognition and Named Entity recognition | Nov 28, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum | Oct 18, 2024 | Speech-to-Text | —Unverified | 0 |
| Digits micro-model for accurate and secure transactions | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Direct Punjabi to English speech translation using discrete units | Feb 25, 2024 | Speech-to-Speech TranslationSpeech-to-Text | —Unverified | 0 |
| Crossing the SSH Bridge with Interview Data | May 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR | Jun 11, 2021 | Simultaneous Speech-to-Text TranslationSpeech-to-Text | —Unverified | 0 |
| BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automated Testing of AI Models | Oct 7, 2021 | FairnessSpeech-to-Text | —Unverified | 0 |
| Analyzing ASR pretraining for low-resource speech-to-text translation | Oct 23, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Case Study on Filtering for End-to-End Speech Translation | Feb 2, 2024 | Speech-to-Speech TranslationSpeech-to-Text | —Unverified | 0 |
| Effectively pretraining a speech translation decoder with Machine Translation data | Nov 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Monotonic Multihead Attention | Dec 7, 2023 | Simultaneous Speech-to-Text TranslationSpeech-to-Text | —Unverified | 0 |
| Graph Neural Networks to Predict Customer Satisfaction Following Interactions with a Corporate Call Center | Jan 31, 2021 | Graph Neural NetworkSpeech-to-Text | —Unverified | 0 |
| Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility | Dec 1, 2021 | Distant Speech RecognitionPosition | —Unverified | 0 |
| Challenges and Opportunities of Speech Recognition for Bengali Language | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improve Sinhala Speech Recognition Through e2e LF-MMI Model | Dec 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Hands-Free VR | Feb 23, 2024 | DiversityLanguage Modelling | —Unverified | 0 |
| CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving | Jun 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AudioPaLM: A Large Language Model That Can Speak and Listen | Jun 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning | Nov 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Corpus Creation and Evaluation for Speech-to-Text and Speech Translation | Aug 1, 2021 | Machine TranslationSpeech-to-Text | —Unverified | 0 |
| A low latency ASR-free end to end spoken language understanding system | Nov 10, 2020 | Speech-to-TextSpoken Language Understanding | —Unverified | 0 |
| Conversational Recommendation System using NLP and Sentiment Analysis | May 17, 2025 | Conversational RecommendationDynamic Time Warping | —Unverified | 0 |
| Audio Interval Retrieval using Convolutional Neural Networks | Sep 21, 2021 | Audio ClassificationRetrieval | —Unverified | 0 |
| AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation | Mar 18, 2025 | DecoderSpeech-to-Text | —Unverified | 0 |
| Contextualized Spoken Word Representations from Convolutional Autoencoders | Jul 6, 2020 | Speech-to-TextWord Embeddings | —Unverified | 0 |
| Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model | Oct 24, 2024 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems | Oct 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advancing STT for Low-Resource Real-World Speech | Jun 10, 2025 | SentenceSpeech-to-Text | —Unverified | 0 |
| Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages | Mar 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language | May 6, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Open Brain AI. Automatic Language Assessment | Jun 11, 2023 | Speech-to-Text | —Unverified | 0 |
| Audio Adversarial Examples: Attacks Using Vocal Masks | Feb 4, 2021 | Adversarial AttackSpeech-to-Text | —Unverified | 0 |
| Comparison of SVD and factorized TDNN approaches for speech to text | Oct 13, 2021 | Speech-to-Text | —Unverified | 0 |
| Compact Speech Translation Models via Discrete Speech Units Pretraining | Feb 29, 2024 | DecoderSelf-Supervised Learning | —Unverified | 0 |
| Acquisition of high-quality images for camera calibration in robotics applications via speech prompts | Apr 15, 2025 | Camera CalibrationSpeech-to-Text | —Unverified | 0 |
| Findings of the Third Workshop on Automatic Simultaneous Translation | Jul 1, 2022 | Speech-to-TextTranslation | —Unverified | 0 |
| Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks | Jan 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Findings of the Second Workshop on Automatic Simultaneous Translation | Jun 1, 2021 | Machine TranslationSpeech-to-Text | —Unverified | 0 |
| Fast Labeling and Transcription with the Speechalyzer Toolkit | May 1, 2012 | Audio ClassificationBenchmarking | —Unverified | 0 |