| The Impact of Code-switched Synthetic Data Quality is Task Dependent: Insights from MT and ASR | Mar 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| VALLR: Visual ASR Language Model for Lip Reading | Mar 27, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| FinAudio: A Benchmark for Audio Large Language Models in Financial Applications | Mar 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages | Mar 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 4 |
| Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization | Mar 25, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Whispering in Amharic: Fine-tuning Whisper for Low-resource Language | Mar 24, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Your voice is your voice: Supporting Self-expression through Speech Generation and LLMs in Augmented and Alternative Communication | Mar 21, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Evaluating ASR Confidence Scores for Automated Error Detection in User-Assisted Correction Interfaces | Mar 19, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Halving transcription time: A fast, user-friendly and GDPR-compliant workflow to create AI-assisted transcripts for content analysis | Mar 17, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Enhancing Aviation Communication Transcription: Fine-Tuning Distil-Whisper with LoRA | Mar 13, 2025 | Automatic Speech Recognitionparameter-efficient fine-tuning | —Unverified | 0 |
| ValSub: Subsampling Validation Data to Mitigate Forgetting during ASR Personalization | Mar 12, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Everything Can Be Described in Words: A Simple Unified Multi-Modal Framework with Semantic and Temporal Alignment | Mar 12, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR | Mar 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling | Mar 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Building English ASR model with regional language support | Mar 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| From Voice to Safety: Language AI Powered Pilot-ATC Communication Understanding for Airport Surface Movement Collision Risk Assessment | Mar 6, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Qieemo: Speech Is All You Need in the Emotion Recognition in Conversations | Mar 5, 2025 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Direct Speech to Speech Translation: A Review | Mar 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Fine-Tuning Whisper for Inclusive Prosodic Stress Analysis | Mar 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unveiling Biases while Embracing Sustainability: Assessing the Dual Challenges of Automatic Speech Recognition Systems | Mar 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation | Feb 27, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications | Feb 27, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR | Feb 27, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision | Feb 26, 2025 | Audio SynthesisAutomatic Speech Recognition | —Unverified | 0 |
| CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition | Feb 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |