Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition Aug 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition Aug 17, 2024 Language Modeling Language Modelling
Code Code Available 0Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words Aug 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement Aug 14, 2024 Automatic Speech Recognition Speech Enhancement
— Unverified 0Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation Aug 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance Aug 12, 2024 Acoustic Scene Classification Automatic Speech Recognition
— Unverified 0Cross-Lingual Conversational Speech Summarization with Large Language Models Aug 12, 2024 Machine Translation speech-recognition
— Unverified 0Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning Aug 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing Aug 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text Aug 10, 2024 Automatic Speech Recognition Hallucination
— Unverified 0HydraFormer: One Encoder For All Subsampling Rates Aug 8, 2024 All Automatic Speech Recognition
Code Code Available 0Preserving spoken content in voice anonymisation with character-level vocoder conditioning Aug 8, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into LaTeX Formulas for Improved Readability Aug 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-Supervised Learning for Multi-Channel Neural Transducer Aug 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval Aug 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion Aug 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024 Aug 5, 2024 Decoder speech-recognition
Code Code Available 0SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data Aug 1, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation Aug 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition Jul 31, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards interfacing large language models with ASR systems using confidence measures and prompting Jul 31, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition Jul 30, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks Jul 28, 2024 Emotion Recognition parameter-efficient fine-tuning
— Unverified 0Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses Jul 26, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks Jul 26, 2024 Generative Adversarial Network Speech Enhancement
— Unverified 0Scaling A Simple Approach to Zero-Shot Speech Recognition Jul 25, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions Jul 25, 2024 Automatic Speech Recognition Decoder
— Unverified 0On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures Jul 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Coupling Speech Encoders with Downstream Text Models Jul 24, 2024 speech-recognition Speech Recognition
— Unverified 0A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives Jul 24, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization Jul 23, 2024 Automatic Speech Recognition Distant Speech Recognition
— Unverified 0Quantifying the Role of Textual Predictability in Automatic Speech Recognition Jul 23, 2024 Attribute Automatic Speech Recognition
— Unverified 0Robustness of Speech Separation Models for Similar-pitch Speakers Jul 22, 2024 speech-recognition Speech Recognition
— Unverified 0Trading Devil Final: Backdoor attack via Stock market and Bayesian Optimization Jul 21, 2024 Automatic Speech Recognition Backdoor Attack
— Unverified 0Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance Jul 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GE2E-AC: Generalized End-to-End Loss Training for Accent Classification Jul 19, 2024 Accented Speech Recognition Classification
— Unverified 0Handling Numeric Expressions in Automatic Speech Recognition Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A light-weight and efficient punctuation and word casing prediction model for on-device streaming ASR Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robust ASR Error Correction with Conservative Data Filtering Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Low-Resourced Speech Recognition for Iu Mien Language via Weakly-Supervised Phoneme-based Multilingual Pre-training Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adaptive Cascading Network for Continual Test-Time Adaptation Jul 17, 2024 image-classification Image Classification
Code Code Available 0Morphosyntactic Analysis for CHILDES Jul 17, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models Jul 16, 2024 Attribute Speaker Identification
Code Code Available 0The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation Jul 16, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Beyond Binary: Multiclass Paraphasia Detection with Generative Pretrained Transformers and End-to-End Models Jul 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Do You Act Like You Talk? Exploring Pose-based Driver Action Classification with Speech Recognition Networks Jul 15, 2024 Action Classification Data Augmentation
Code Code Available 0Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data Jul 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Textless Dependency Parsing by Labeled Sequence Prediction Jul 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation Jul 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CUSIDE-array: A Streaming Multi-Channel End-to-End Speech Recognition System with Realistic Evaluations Jul 13, 2024 Chunking speech-recognition
— Unverified 0