| SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization | Jun 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Explainability of Speech Recognition Transformers via Gradient-based Attention Visualization | Jun 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Streaming Speech-to-Confusion Network Speech Recognition | Jun 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Can Contextual Biasing Remain Effective with Whisper and GPT-2? | Jun 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Improved DeepFake Detection Using Whisper Features | Jun 2, 2023 | Automatic Speech RecognitionDeepFake Detection | CodeCode Available | 1 |
| Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation | Jun 2, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Audio-Visual Speech Enhancement with Score-Based Generative Models | Jun 2, 2023 | Automatic Speech RecognitionLipreading | —Unverified | 0 |
| Some voices are too common: Building fair speech recognition systems using the Common Voice dataset | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Encoder-decoder multimodal speaker change detection | Jun 1, 2023 | Automatic Speech RecognitionChange Detection | —Unverified | 0 |
| Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Adaptation and Optimization of Automatic Speech Recognition (ASR) for the Maritime Domain in the Field of VHF Communication | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AfriNames: Most ASR models "butcher" African Names | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SlothSpeech: Denial-of-service Attack Against Speech Recognition Models | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning | May 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Strategies for improving low resource speech to text translation relying on pre-trained ASR models | May 31, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Accurate and Structured Pruning for Efficient Automatic Speech Recognition | May 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition | May 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Zero-Shot Automatic Pronunciation Assessment | May 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Selection of Text-to-speech Data to Augment ASR Training | May 30, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Adapting Multi-Lingual ASR Models for Handling Multiple Talkers | May 30, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator | May 30, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions | May 30, 2023 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Building Accurate Low Latency ASR for Streaming Voice Search | May 29, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation | May 29, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment | May 29, 2023 | Automatic Speech RecognitionMulti-Task Learning | —Unverified | 0 |
| Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition | May 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target | May 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice | May 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction | May 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution | May 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| 2-bit Conformer quantization for automatic speech recognition | May 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Scheduled Sampling for Neural Transducer-based ASR | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Mixture-of-Expert Conformer for Streaming Multilingual ASR | May 25, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Svarah: Evaluating English ASR Systems on Indian Accents | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition | May 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Textless Speech-to-Speech Translation With Limited Parallel Data | May 24, 2023 | Automatic Speech RecognitionDenoising | CodeCode Available | 0 |
| Iteratively Improving Speech Recognition and Voice Conversion | May 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation | May 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Personalized Predictive ASR for Latency Reduction in Voice Assistants | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers | May 23, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |