| Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator | May 30, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment | May 29, 2023 | Automatic Speech RecognitionMulti-Task Learning | —Unverified | 0 |
| CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice | May 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition | May 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Building Accurate Low Latency ASR for Streaming Voice Search | May 29, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target | May 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation | May 29, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution | May 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction | May 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| 2-bit Conformer quantization for automatic speech recognition | May 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Mixture-of-Expert Conformer for Streaming Multilingual ASR | May 25, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Scheduled Sampling for Neural Transducer-based ASR | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Svarah: Evaluating English ASR Systems on Indian Accents | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition | May 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Textless Speech-to-Speech Translation With Limited Parallel Data | May 24, 2023 | Automatic Speech RecognitionDenoising | CodeCode Available | 0 |
| Iteratively Improving Speech Recognition and Voice Conversion | May 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation | May 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers | May 23, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |