wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech Aug 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into LaTeX Formulas for Improved Readability Aug 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-Supervised Learning for Multi-Channel Neural Transducer Aug 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval Aug 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024 Aug 5, 2024 Decoder speech-recognition
Code Code Available 0StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion Aug 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features Aug 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data Aug 1, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation Aug 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards interfacing large language models with ASR systems using confidence measures and prompting Jul 31, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Llama 3 Herd of Models Jul 31, 2024 answerability prediction Language Modeling
Code Code Available 4On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition Jul 31, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition Jul 30, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks Jul 28, 2024 Emotion Recognition parameter-efficient fine-tuning
— Unverified 0Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses Jul 26, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation Jul 26, 2024 Contrastive Learning speech-recognition
Code Code Available 1Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing Jul 26, 2024 Attribute Language Modelling
Code Code Available 1Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks Jul 26, 2024 Generative Adversarial Network Speech Enhancement
— Unverified 0On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures Jul 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions Jul 25, 2024 Automatic Speech Recognition Decoder
— Unverified 0Scaling A Simple Approach to Zero-Shot Speech Recognition Jul 25, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Sentiment Reasoning for Healthcare Jul 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3Coupling Speech Encoders with Downstream Text Models Jul 24, 2024 speech-recognition Speech Recognition
— Unverified 0A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives Jul 24, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization Jul 23, 2024 Automatic Speech Recognition Distant Speech Recognition
— Unverified 0Quantifying the Role of Textual Predictability in Automatic Speech Recognition Jul 23, 2024 Attribute Automatic Speech Recognition
— Unverified 0Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction Jul 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Robustness of Speech Separation Models for Similar-pitch Speakers Jul 22, 2024 speech-recognition Speech Recognition
— Unverified 0dMel: Speech Tokenization made Simple Jul 22, 2024 Decoder Language Modeling
Code Code Available 1Trading Devil Final: Backdoor attack via Stock market and Bayesian Optimization Jul 21, 2024 Automatic Speech Recognition Backdoor Attack
— Unverified 0GE2E-AC: Generalized End-to-End Loss Training for Accent Classification Jul 19, 2024 Accented Speech Recognition Classification
— Unverified 0Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance Jul 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Handling Numeric Expressions in Automatic Speech Recognition Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robust ASR Error Correction with Conservative Data Filtering Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A light-weight and efficient punctuation and word casing prediction model for on-device streaming ASR Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Low-Resourced Speech Recognition for Iu Mien Language via Weakly-Supervised Phoneme-based Multilingual Pre-training Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adaptive Cascading Network for Continual Test-Time Adaptation Jul 17, 2024 image-classification Image Classification
Code Code Available 0Morphosyntactic Analysis for CHILDES Jul 17, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation Jul 16, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models Jul 16, 2024 Attribute Speaker Identification
Code Code Available 0Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors Jul 16, 2024 Automatic Phoneme Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Beyond Binary: Multiclass Paraphasia Detection with Generative Pretrained Transformers and End-to-End Models Jul 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Do You Act Like You Talk? Exploring Pose-based Driver Action Classification with Speech Recognition Networks Jul 15, 2024 Action Classification Data Augmentation
Code Code Available 0Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data Jul 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Textless Dependency Parsing by Labeled Sequence Prediction Jul 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation Jul 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text-Based Detection of On-Hold Scripts in Contact Center Calls Jul 13, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System Jul 13, 2024 Decoder speech-recognition
Code Code Available 1Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis Jul 13, 2024 Mamba speech-recognition
Code Code Available 2