| Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition | Sep 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASR Error Correction using Large Language Models | Sep 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy? | Sep 13, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring SSL Discrete Tokens for Multilingual ASR | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language | Sep 12, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models | Sep 12, 2024 | Adversarial AttackAdversarial Purification | CodeCode Available | 0 |
| Full-text Error Correction for Chinese Speech Recognition with Large Language Model | Sep 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing CTC-Based Visual Speech Recognition | Sep 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition | Sep 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition | Sep 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking | Sep 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings | Sep 10, 2024 | Automatic Speech RecognitionDiversity | CodeCode Available | 0 |
| Retrieval Augmented Correction of Named Entity Speech Recognition Errors | Sep 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge | Sep 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR | Sep 9, 2024 | Automatic Speech Recognitionspeaker-diarization | —Unverified | 0 |
| NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge | Sep 9, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Evaluation of real-time transcriptions using end-to-end ASR models | Sep 9, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| An investigation of modularity for noise robustness in conformer-based ASR | Sep 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection | Sep 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Quantification of stylistic differences in human- and ASR-produced transcripts of African American English | Sep 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Sep 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Probing self-attention in self-supervised speech models for cross-linguistic differences | Sep 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Reassessing Noise Augmentation Methods in the Context of Adversarial Speech | Sep 3, 2024 | Adversarial RobustnessAutomatic Speech Recognition | —Unverified | 0 |
| Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR | Sep 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka | Sep 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR | Sep 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition | Sep 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Comparing Discrete and Continuous Space LLMs for Speech Recognition | Sep 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder | Aug 30, 2024 | Automatic Speech RecognitionDiagnostic | —Unverified | 0 |
| Speaker Tagging Correction With Non-Autoregressive Language Models | Aug 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advancing Multi-talker ASR Performance with Large Language Models | Aug 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Measuring the Accuracy of Automatic Speech Recognition Solutions | Aug 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction | Aug 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Beyond Levenshtein: Leveraging Multiple Algorithms for Robust Word Error Rate Computations And Granular Error Classifications | Aug 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Literary and Colloquial Dialect Identification for Tamil using Acoustic Features | Aug 27, 2024 | Automatic Speech RecognitionDialect Identification | —Unverified | 0 |
| Research Advances and New Paradigms for Biology-inspired Spiking Neural Networks | Aug 26, 2024 | Automatic Speech RecognitionBrain Computer Interface | —Unverified | 0 |
| Automatic recognition and detection of aphasic natural speech | Aug 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues | Aug 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-supervised Speech Representations Still Struggle with African American Vernacular English | Aug 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models | Aug 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features | Aug 22, 2024 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| The State of Commercial Automatic French Legal Speech Recognition Systems and their Impact on Court Reporters et al | Aug 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition | Aug 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts | Aug 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words | Aug 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |