PolySpeech: Exploring Unified Multitask Speech Models for Competitiveness with Single-task Models Jun 12, 2024 Language Modeling Language Modelling
— Unverified 0ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness Jun 12, 2024 Action Detection Activity Detection
— Unverified 0PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Refining Self-Supervised Learnt Speech Representation using Brain Activations Jun 12, 2024 Automatic Speech Recognition Speaker Verification
— Unverified 0Transformer-based Model for ASR N-Best Rescoring and Rewriting Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Spoken Language Corpora Augmentation with Domain-Specific Voice-Cloned Speech Jun 11, 2024 speech-recognition Speech Recognition
— Unverified 0Reading Miscue Detection in Primary School through Automatic Speech Recognition Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection Jun 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Tag and correct: high precision post-editing approach to correction of speech recognition errors Jun 11, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Label-Looping: Highly Efficient Decoding for Transducers Jun 10, 2024 GPU speech-recognition
— Unverified 0Synthetic Query Generation using Large Language Models for Virtual Assistants Jun 10, 2024 Information Retrieval speech-recognition
— Unverified 0ASTRA: Aligning Speech and Text Representations for Asr without Sampling Jun 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Parameter-efficient Language Extension Framework for Multilingual ASR Jun 10, 2024 Continual Learning parameter-efficient fine-tuning
— Unverified 0MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations Jun 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper Jun 9, 2024 speech-recognition Speech Recognition
Code Code Available 0Optimizing Multi-Stuttered Speech Classification: Leveraging Whisper's Encoder for Efficient Parameter Reduction in Automated Assessment Jun 9, 2024 Multi-Label Classification MUlTI-LABEL-ClASSIFICATION
— Unverified 0Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis Jun 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR Jun 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hypernetworks for Personalizing ASR to Atypical Speech Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Helsinki Speech Challenge 2024 Jun 6, 2024 Speech Enhancement speech-recognition
— Unverified 0To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU Jun 6, 2024 GPU speech-recognition
— Unverified 0Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Flexible Multichannel Speech Enhancement for Noise-Robust Frontend Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text Injection for Neural Contextual Biasing Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Error-preserving Automatic Speech Recognition of Young English Learners' Language Jun 5, 2024 Automatic Speech Recognition Language Modelling
Code Code Available 0Joint Beam Search Integrating CTC, Attention, and Transducer Decoders Jun 5, 2024 Automatic Speech Recognition Decoder
— Unverified 0Enhancing CTC-based speech recognition with diverse modeling units Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition Jun 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision Jun 4, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping Jun 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing Jun 4, 2024 Decoder Language Modeling
— Unverified 0Keyword-Guided Adaptation of Automatic Speech Recognition Jun 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach Jun 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization Jun 3, 2024 image-classification Image Classification
— Unverified 0YODAS: Youtube-Oriented Dataset for Audio and Speech Jun 2, 2024 Self-Supervised Learning speech-recognition
— Unverified 0Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning Jun 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities May 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Augmented Conversation with Embedded Speech-Driven On-the-Fly Referencing in AR May 28, 2024 Friction speech-recognition
— Unverified 0Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation May 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0NUTS, NARS, and Speech May 28, 2024 Dimensionality Reduction speech-recognition
— Unverified 0Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients May 27, 2024 Automatic Speech Recognition Federated Learning
Code Code Available 0Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition May 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding May 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish May 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ST-Gait++: Leveraging spatio-temporal convolutions for gait-based emotion recognition on videos May 22, 2024 Emotion Classification Emotion Recognition
— Unverified 0Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation May 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextualized Automatic Speech Recognition with Dynamic Vocabulary May 22, 2024 Automatic Speech Recognition Language Modeling
— Unverified 0