| DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement | Aug 14, 2024 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation | Aug 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning | Aug 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance | Aug 12, 2024 | Acoustic Scene ClassificationAutomatic Speech Recognition | —Unverified | 0 |
| VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing | Aug 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text | Aug 10, 2024 | Automatic Speech RecognitionHallucination | —Unverified | 0 |
| Preserving spoken content in voice anonymisation with character-level vocoder conditioning | Aug 8, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| HydraFormer: One Encoder For All Subsampling Rates | Aug 8, 2024 | AllAutomatic Speech Recognition | CodeCode Available | 0 |
| MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into LaTeX Formulas for Improved Readability | Aug 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval | Aug 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-Supervised Learning for Multi-Channel Neural Transducer | Aug 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion | Aug 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data | Aug 1, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation | Aug 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition | Jul 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards interfacing large language models with ASR systems using confidence measures and prompting | Jul 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition | Jul 30, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses | Jul 26, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Scaling A Simple Approach to Zero-Shot Speech Recognition | Jul 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures | Jul 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions | Jul 25, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives | Jul 24, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization | Jul 23, 2024 | Automatic Speech RecognitionDistant Speech Recognition | —Unverified | 0 |
| Quantifying the Role of Textual Predictability in Automatic Speech Recognition | Jul 23, 2024 | AttributeAutomatic Speech Recognition | —Unverified | 0 |
| Trading Devil Final: Backdoor attack via Stock market and Bayesian Optimization | Jul 21, 2024 | Automatic Speech RecognitionBackdoor Attack | —Unverified | 0 |
| Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance | Jul 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Handling Numeric Expressions in Automatic Speech Recognition | Jul 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Robust ASR Error Correction with Conservative Data Filtering | Jul 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A light-weight and efficient punctuation and word casing prediction model for on-device streaming ASR | Jul 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Low-Resourced Speech Recognition for Iu Mien Language via Weakly-Supervised Phoneme-based Multilingual Pre-training | Jul 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Morphosyntactic Analysis for CHILDES | Jul 17, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Beyond Binary: Multiclass Paraphasia Detection with Generative Pretrained Transformers and End-to-End Models | Jul 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation | Jul 16, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data | Jul 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Textless Dependency Parsing by Labeled Sequence Prediction | Jul 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation | Jul 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Text-Based Detection of On-Hold Scripts in Contact Center Calls | Jul 13, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing | Jul 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation | Jul 8, 2024 | Automatic Speech RecognitionEmotion Recognition | —Unverified | 0 |
| Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation | Jul 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Written Term Detection Improves Spoken Term Detection | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| LearnerVoice: A Dataset of Non-Native English Learners' Spontaneous Speech | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian Dialect | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models | Jul 5, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Semi-supervised Learning for Code-Switching ASR with Large Language Model Filter | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Romanization Encoding For Multilingual ASR | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Convformer: Extending Conformer with Multiple Convolution Kernels | Jul 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis | Jul 4, 2024 | Accented Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |