Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR Sep 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka Sep 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR Sep 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Framework for Synthetic Audio Conversations Generation using Large Language Models Sep 2, 2024 Audio Classification Audio Tagging
— Unverified 0Comparing Discrete and Continuous Space LLMs for Speech Recognition Sep 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition Sep 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Progressive Residual Extraction based Pre-training for Speech Representation Learning Aug 31, 2024 Emotion Recognition Representation Learning
— Unverified 0DCIM-AVSR : Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction Module Aug 31, 2024 Audio-Visual Speech Recognition speech-recognition
— Unverified 0Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder Aug 30, 2024 Automatic Speech Recognition Diagnostic
— Unverified 0Speaker Tagging Correction With Non-Autoregressive Language Models Aug 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ProGRes: Prompted Generative Rescoring on ASR n-Best Aug 30, 2024 speech-recognition Speech Recognition
Code Code Available 0Advancing Multi-talker ASR Performance with Large Language Models Aug 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Measuring the Accuracy of Automatic Speech Recognition Solutions Aug 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Revisit Micro-batch Clipping: Adaptive Data Pruning via Gradient Manipulation Aug 29, 2024 speech-recognition Speech Recognition
— Unverified 0CrisperWhisper: Accurate Timestamps on Verbatim Speech Transcriptions Aug 29, 2024 Dynamic Time Warping speech-recognition
Code Code Available 4Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction Aug 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Beyond Levenshtein: Leveraging Multiple Algorithms for Robust Word Error Rate Computations And Granular Error Classifications Aug 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Recognition Transformers: Topological-lingualism Perspective Aug 27, 2024 speech-recognition Speech Recognition
— Unverified 0Literary and Colloquial Dialect Identification for Tamil using Acoustic Features Aug 27, 2024 Automatic Speech Recognition Dialect Identification
— Unverified 0Automatic recognition and detection of aphasic natural speech Aug 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-supervised Speech Representations Still Struggle with African American Vernacular English Aug 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Research Advances and New Paradigms for Biology-inspired Spiking Neural Networks Aug 26, 2024 Automatic Speech Recognition Brain Computer Interface
— Unverified 0MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues Aug 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Literary and Colloquial Tamil Dialect Identification Aug 25, 2024 Dialect Identification speech-recognition
— Unverified 0Studying the Effect of Audio Filters in Pre-Trained Models for Environmental Sound Classification Aug 24, 2024 Classification Environmental Sound Classification
— Unverified 0Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models Aug 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features Aug 22, 2024 Automatic Speech Recognition Self-Supervised Learning
— Unverified 0Positional Description for Numerical Normalization Aug 22, 2024 speech-recognition Speech Recognition
— Unverified 0Towards measuring fairness in speech recognition: Fair-Speech dataset Aug 22, 2024 Fairness speech-recognition
— Unverified 0The State of Commercial Automatic French Legal Speech Recognition Systems and their Impact on Court Reporters et al Aug 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Speech Recognition Error Prediction for Modern and Off-the-shelf Speech Recognizers Aug 21, 2024 Language Modeling Language Modelling
— Unverified 0Approaching Deep Learning through the Spectral Dynamics of Weights Aug 21, 2024 Deep Learning image-classification
Code Code Available 1XCB: an effective contextual biasing approach to bias cross-lingual phrases in speech recognition Aug 20, 2024 speech-recognition Speech Recognition
— Unverified 0Toward Large-scale Spiking Neural Networks: A Comprehensive Survey and Future Directions Aug 19, 2024 speech-recognition Speech Recognition
— Unverified 0Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition Aug 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts Aug 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition Aug 17, 2024 Language Modeling Language Modelling
Code Code Available 0Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words Aug 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement Aug 14, 2024 Automatic Speech Recognition Speech Enhancement
— Unverified 0SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition Aug 14, 2024 Automatic Speech Recognition Benchmarking
Code Code Available 1Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation Aug 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-Lingual Conversational Speech Summarization with Large Language Models Aug 12, 2024 Machine Translation speech-recognition
— Unverified 0Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning Aug 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance Aug 12, 2024 Acoustic Scene Classification Automatic Speech Recognition
— Unverified 0LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition Aug 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing Aug 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text Aug 10, 2024 Automatic Speech Recognition Hallucination
— Unverified 0MooER: LLM-based Speech Recognition and Translation Models from Moore Threads Aug 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3Preserving spoken content in voice anonymisation with character-level vocoder conditioning Aug 8, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0HydraFormer: One Encoder For All Subsampling Rates Aug 8, 2024 All Automatic Speech Recognition
Code Code Available 0