| Comparing Discrete and Continuous Space LLMs for Speech Recognition | Sep 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition | Sep 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speaker Tagging Correction With Non-Autoregressive Language Models | Aug 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder | Aug 30, 2024 | Automatic Speech RecognitionDiagnostic | —Unverified | 0 |
| Advancing Multi-talker ASR Performance with Large Language Models | Aug 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction | Aug 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Measuring the Accuracy of Automatic Speech Recognition Solutions | Aug 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Beyond Levenshtein: Leveraging Multiple Algorithms for Robust Word Error Rate Computations And Granular Error Classifications | Aug 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Literary and Colloquial Dialect Identification for Tamil using Acoustic Features | Aug 27, 2024 | Automatic Speech RecognitionDialect Identification | —Unverified | 0 |
| Automatic recognition and detection of aphasic natural speech | Aug 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Research Advances and New Paradigms for Biology-inspired Spiking Neural Networks | Aug 26, 2024 | Automatic Speech RecognitionBrain Computer Interface | —Unverified | 0 |
| Self-supervised Speech Representations Still Struggle with African American Vernacular English | Aug 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues | Aug 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models | Aug 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features | Aug 22, 2024 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| The State of Commercial Automatic French Legal Speech Recognition Systems and their Impact on Court Reporters et al | Aug 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition | Aug 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts | Aug 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words | Aug 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement | Aug 14, 2024 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition | Aug 14, 2024 | Automatic Speech RecognitionBenchmarking | CodeCode Available | 1 |
| Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation | Aug 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning | Aug 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance | Aug 12, 2024 | Acoustic Scene ClassificationAutomatic Speech Recognition | —Unverified | 0 |
| LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition | Aug 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing | Aug 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text | Aug 10, 2024 | Automatic Speech RecognitionHallucination | —Unverified | 0 |
| MooER: LLM-based Speech Recognition and Translation Models from Moore Threads | Aug 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| Preserving spoken content in voice anonymisation with character-level vocoder conditioning | Aug 8, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| HydraFormer: One Encoder For All Subsampling Rates | Aug 8, 2024 | AllAutomatic Speech Recognition | CodeCode Available | 0 |
| wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech | Aug 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into LaTeX Formulas for Improved Readability | Aug 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval | Aug 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-Supervised Learning for Multi-Channel Neural Transducer | Aug 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion | Aug 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features | Aug 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data | Aug 1, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation | Aug 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards interfacing large language models with ASR systems using confidence measures and prompting | Jul 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition | Jul 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition | Jul 30, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses | Jul 26, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures | Jul 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Scaling A Simple Approach to Zero-Shot Speech Recognition | Jul 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions | Jul 25, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Sentiment Reasoning for Healthcare | Jul 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives | Jul 24, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization | Jul 23, 2024 | Automatic Speech RecognitionDistant Speech Recognition | —Unverified | 0 |
| Quantifying the Role of Textual Predictability in Automatic Speech Recognition | Jul 23, 2024 | AttributeAutomatic Speech Recognition | —Unverified | 0 |
| Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction | Jul 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |