Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM Sep 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction Sep 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition Sep 21, 2024 Audio Deepfake Detection DeepFake Detection
— Unverified 0MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder Sep 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection Sep 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Multimodal Dense Retrieval Approach for Speech-Based Open-Domain Question Answering Sep 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR Sep 20, 2024 ARC Automatic Speech Recognition
— Unverified 0Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper Sep 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Model Should Understand Pinyin for Chinese ASR Error Correction Sep 20, 2024 Automatic Speech Recognition Language Modeling
— Unverified 0Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space Sep 19, 2024 Automatic Speech Recognition Data Augmentation
— Unverified 0Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition Sep 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Personalized Speech Recognition for Children with Test-Time Adaptation Sep 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robust Audiovisual Speech Recognition Models with Mixture-of-Experts Sep 19, 2024 Mixture-of-Experts Robust Speech Recognition
— Unverified 0ASR Benchmarking: Need for a More Representative Conversational Dataset Sep 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR Sep 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0WER We Stand: Benchmarking Urdu ASR Models Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Chain-of-Thought Prompting for Speech Translation Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework Sep 17, 2024 Phoneme Recognition speech-recognition
— Unverified 0M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses Sep 17, 2024 Action Detection Activity Detection
— Unverified 0Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bio-Inspired Mamba: Temporal Locality and Bioplausible Learning in Selective State Space Models Sep 17, 2024 Language Modeling Language Modelling
— Unverified 0Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models Sep 17, 2024 Audio captioning Instruction Following
— Unverified 0An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Augmenting Automatic Speech Recognition Models with Disfluency Detection Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models Sep 16, 2024 Automatic Speech Recognition Prompt Engineering
— Unverified 0SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition Sep 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ASR Error Correction using Large Language Models Sep 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring SSL Discrete Tokens for Multilingual ASR Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy? Sep 13, 2024 Automatic Speech Recognition Decoder
— Unverified 0Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Clean Label Attacks against SLU Systems Sep 13, 2024 Data Poisoning speech-recognition
— Unverified 0Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction Sep 12, 2024 Depression Detection speech-recognition
— Unverified 0Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models Sep 12, 2024 Adversarial Attack Adversarial Purification
Code Code Available 0Full-text Error Correction for Chinese Speech Recognition with Large Language Model Sep 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language Sep 12, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Faster Speech-LLaMA Inference with Multi-token Prediction Sep 12, 2024 Decoder Prediction
— Unverified 0Contextualization of ASR with LLM using phonetic retrieval-based augmentation Sep 11, 2024 Retrieval speech-recognition
— Unverified 0Enhancing CTC-Based Visual Speech Recognition Sep 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition Sep 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Rethinking Mamba in Speech Processing by Self-Supervised Models Sep 11, 2024 Mamba Speech Enhancement
— Unverified 0How Redundant Is the Transformer Stack in Speech Representation Models? Sep 10, 2024 Knowledge Distillation Speaker Identification
— Unverified 0An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition Sep 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking Sep 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings Sep 10, 2024 Automatic Speech Recognition Diversity
Code Code Available 0