Evaluation of real-time transcriptions using end-to-end ASR models Sep 9, 2024 Action Detection Activity Detection
— Unverified 0Consensus-based Distributed Quantum Kernel Learning for Speech Recognition Sep 9, 2024 Computational Efficiency Emotion Recognition
— Unverified 0Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge Sep 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR Sep 9, 2024 Automatic Speech Recognition speaker-diarization
— Unverified 0NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge Sep 9, 2024 Action Detection Activity Detection
— Unverified 0Retrieval Augmented Correction of Named Entity Speech Recognition Errors Sep 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Longer is (Not Necessarily) Stronger: Punctuated Long-Sequence Training for Enhanced Speech Recognition and Translation Sep 9, 2024 speech-recognition Speech Recognition
— Unverified 0An investigation of modularity for noise robustness in conformer-based ASR Sep 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection Sep 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lightweight Transducer Based on Frame-Level Criterion Sep 5, 2024 Decoder imbalanced classification
Code Code Available 0What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations Sep 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficient Extraction of Noise-Robust Discrete Units from Self-Supervised Speech Models Sep 4, 2024 Decoder Noisy Speech Recognition
— Unverified 0Probing self-attention in self-supervised speech models for cross-linguistic differences Sep 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Quantification of stylistic differences in human- and ASR-produced transcripts of African American English Sep 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Code-Switching Speech Recognition with LID-Based Collaborative Mixture of Experts Model Sep 3, 2024 Language Identification Mixture-of-Experts
— Unverified 0VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka Sep 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge Sep 3, 2024 speech-recognition Speech Recognition
— Unverified 0Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR Sep 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Reassessing Noise Augmentation Methods in the Context of Adversarial Speech Sep 3, 2024 Adversarial Robustness Automatic Speech Recognition
— Unverified 0A Framework for Synthetic Audio Conversations Generation using Large Language Models Sep 2, 2024 Audio Classification Audio Tagging
— Unverified 0Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR Sep 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition Sep 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Comparing Discrete and Continuous Space LLMs for Speech Recognition Sep 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Progressive Residual Extraction based Pre-training for Speech Representation Learning Aug 31, 2024 Emotion Recognition Representation Learning
— Unverified 0DCIM-AVSR : Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction Module Aug 31, 2024 Audio-Visual Speech Recognition speech-recognition
— Unverified 0ProGRes: Prompted Generative Rescoring on ASR n-Best Aug 30, 2024 speech-recognition Speech Recognition
Code Code Available 0Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder Aug 30, 2024 Automatic Speech Recognition Diagnostic
— Unverified 0Advancing Multi-talker ASR Performance with Large Language Models Aug 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speaker Tagging Correction With Non-Autoregressive Language Models Aug 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction Aug 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Measuring the Accuracy of Automatic Speech Recognition Solutions Aug 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Revisit Micro-batch Clipping: Adaptive Data Pruning via Gradient Manipulation Aug 29, 2024 speech-recognition Speech Recognition
— Unverified 0Beyond Levenshtein: Leveraging Multiple Algorithms for Robust Word Error Rate Computations And Granular Error Classifications Aug 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Recognition Transformers: Topological-lingualism Perspective Aug 27, 2024 speech-recognition Speech Recognition
— Unverified 0Literary and Colloquial Dialect Identification for Tamil using Acoustic Features Aug 27, 2024 Automatic Speech Recognition Dialect Identification
— Unverified 0Self-supervised Speech Representations Still Struggle with African American Vernacular English Aug 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Automatic recognition and detection of aphasic natural speech Aug 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Research Advances and New Paradigms for Biology-inspired Spiking Neural Networks Aug 26, 2024 Automatic Speech Recognition Brain Computer Interface
— Unverified 0MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues Aug 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Literary and Colloquial Tamil Dialect Identification Aug 25, 2024 Dialect Identification speech-recognition
— Unverified 0Studying the Effect of Audio Filters in Pre-Trained Models for Environmental Sound Classification Aug 24, 2024 Classification Environmental Sound Classification
— Unverified 0Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models Aug 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features Aug 22, 2024 Automatic Speech Recognition Self-Supervised Learning
— Unverified 0Positional Description for Numerical Normalization Aug 22, 2024 speech-recognition Speech Recognition
— Unverified 0Towards measuring fairness in speech recognition: Fair-Speech dataset Aug 22, 2024 Fairness speech-recognition
— Unverified 0Improving Speech Recognition Error Prediction for Modern and Off-the-shelf Speech Recognizers Aug 21, 2024 Language Modeling Language Modelling
— Unverified 0The State of Commercial Automatic French Legal Speech Recognition Systems and their Impact on Court Reporters et al Aug 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0XCB: an effective contextual biasing approach to bias cross-lingual phrases in speech recognition Aug 20, 2024 speech-recognition Speech Recognition
— Unverified 0Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts Aug 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition Aug 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0