| Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multimodal Speech Recognition for Language-Guided Embodied Agents | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A low latency attention module for streaming self-supervised speech representation learning | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Explanations for Automatic Speech Recognition | Feb 27, 2023 | Automatic Speech RecognitionExplainable Artificial Intelligence (XAI) | —Unverified | 0 |
| A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech Corpora Divergence Based Unsupervised Data Selection for ASR | Feb 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Ensemble for Multimodal Punctuation Restoration using Time-Delay Neural Network | Feb 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving Massively Multilingual ASR With Auxiliary CTC Objectives | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Factual Consistency Oriented Speech Recognition | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Ensemble knowledge distillation of self-supervised speech models | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Evaluating Automatic Speech Recognition in an Incremental Setting | Feb 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition | Feb 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| UML: A Universal Monolingual Output Layer for Multilingual ASR | Feb 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation | Feb 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Connecting Humanities and Social Sciences: Applying Language and Speech Technology to Online Panel Surveys | Feb 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An ASR-free Fluency Scoring Approach with Self-Supervised Learning | Feb 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition | Feb 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speaker and Language Change Detection using Wav2vec2 and Whisper | Feb 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Massively Multilingual Shallow Fusion with Large Language Models | Feb 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Adaptive Axonal Delays in feedforward spiking neural networks for accurate spoken word recognition | Feb 16, 2023 | Audio ClassificationAutomatic Speech Recognition | —Unverified | 0 |
| Speaker Change Detection for Transformer Transducer ASR | Feb 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax | Feb 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Stabilising and accelerating light gated recurrent units for automatic speech recognition | Feb 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems | Feb 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| ASR Bundestag: A Large-Scale political debate dataset in German | Feb 12, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems | Feb 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction | Feb 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions | Feb 9, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MAC: A unified framework boosting low resource automatic speech recognition | Feb 5, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition | Feb 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Fillers in Spoken Language Understanding: Computational and Psycholinguistic Perspectives | Jan 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study | Jan 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset | Jan 21, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Language Agnostic Data-Driven Inverse Text Normalization | Jan 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition | Jan 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Syllable Subword Tokens for Open Vocabulary Speech Recognition in Malayalam | Jan 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Multi-resolution location-based training for multi-channel continuous speech separation | Jan 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Using Kaldi for Automatic Speech Recognition of Conversational Austrian German | Jan 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BayesSpeech: A Bayesian Transformer Network for Automatic Speech Recognition | Jan 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Streaming Punctuation: A Novel Punctuation Technique Leveraging Bidirectional Context for Continuous Speech Recognition | Jan 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project | Jan 1, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition | Dec 30, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation | Dec 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Alignment Entropy Regularization | Dec 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| 4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders | Dec 21, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Automatic Speech Recognition model for the Sudanese Dialect | Dec 21, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Mu^2SLAM: Multitask, Multilingual Speech and Language Models | Dec 19, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |