Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Chain-of-Thought Prompting for Speech Translation Sep 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bio-Inspired Mamba: Temporal Locality and Bioplausible Learning in Selective State Space Models Sep 17, 2024 Language Modeling Language Modelling
— Unverified 0Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models Sep 17, 2024 Audio captioning Instruction Following
— Unverified 0SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models Sep 16, 2024 Automatic Speech Recognition Prompt Engineering
— Unverified 0An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Augmenting Automatic Speech Recognition Models with Disfluency Detection Sep 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition Sep 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ASR Error Correction using Large Language Models Sep 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Clean Label Attacks against SLU Systems Sep 13, 2024 Data Poisoning speech-recognition
— Unverified 0Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy? Sep 13, 2024 Automatic Speech Recognition Decoder
— Unverified 0Exploring SSL Discrete Tokens for Multilingual ASR Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training Sep 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models Sep 12, 2024 Adversarial Attack Adversarial Purification
Code Code Available 0Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction Sep 12, 2024 Depression Detection speech-recognition
— Unverified 0The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language Sep 12, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Full-text Error Correction for Chinese Speech Recognition with Large Language Model Sep 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0WhisperNER: Unified Open Named Entity and Speech Recognition Sep 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3Faster Speech-LLaMA Inference with Multi-token Prediction Sep 12, 2024 Decoder Prediction
— Unverified 0Contextualization of ASR with LLM using phonetic retrieval-based augmentation Sep 11, 2024 Retrieval speech-recognition
— Unverified 0Rethinking Mamba in Speech Processing by Self-Supervised Models Sep 11, 2024 Mamba Speech Enhancement
— Unverified 0Enhancing CTC-Based Visual Speech Recognition Sep 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition Sep 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0How Redundant Is the Transformer Stack in Speech Representation Models? Sep 10, 2024 Knowledge Distillation Speaker Identification
— Unverified 0An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition Sep 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings Sep 10, 2024 Automatic Speech Recognition Diversity
Code Code Available 0Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking Sep 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge Sep 9, 2024 Action Detection Activity Detection
— Unverified 0An investigation of modularity for noise robustness in conformer-based ASR Sep 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR Sep 9, 2024 Automatic Speech Recognition speaker-diarization
— Unverified 0Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge Sep 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Retrieval Augmented Correction of Named Entity Speech Recognition Errors Sep 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Longer is (Not Necessarily) Stronger: Punctuated Long-Sequence Training for Enhanced Speech Recognition and Translation Sep 9, 2024 speech-recognition Speech Recognition
— Unverified 0Consensus-based Distributed Quantum Kernel Learning for Speech Recognition Sep 9, 2024 Computational Efficiency Emotion Recognition
— Unverified 0Evaluation of real-time transcriptions using end-to-end ASR models Sep 9, 2024 Action Detection Activity Detection
— Unverified 0Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection Sep 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lightweight Transducer Based on Frame-Level Criterion Sep 5, 2024 Decoder imbalanced classification
Code Code Available 0Efficient Extraction of Noise-Robust Discrete Units from Self-Supervised Speech Models Sep 4, 2024 Decoder Noisy Speech Recognition
— Unverified 0Quantification of stylistic differences in human- and ASR-produced transcripts of African American English Sep 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Probing self-attention in self-supervised speech models for cross-linguistic differences Sep 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations Sep 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge Sep 3, 2024 speech-recognition Speech Recognition
— Unverified 0Enhancing Code-Switching Speech Recognition with LID-Based Collaborative Mixture of Experts Model Sep 3, 2024 Language Identification Mixture-of-Experts
— Unverified 0Reassessing Noise Augmentation Methods in the Context of Adversarial Speech Sep 3, 2024 Adversarial Robustness Automatic Speech Recognition
— Unverified 0