DiarizationLM: Speaker Diarization Post-Processing with Large Language Models Jan 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition Jan 7, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge Jan 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR Jan 6, 2024 Active Learning Automatic Speech Recognition
Code Code Available 0Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition Jan 4, 2024 Attribute Automatic Speech Recognition
Code Code Available 0Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models Jan 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Probing Contact Center Large Language Models Dec 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring data augmentation in bias mitigation against non-native-accented speech Dec 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BLSTM-Based Confidence Estimation for End-to-End Speech Recognition Dec 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification Dec 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models Dec 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition Dec 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SpokesBiz -- an Open Corpus of Conversational Polish Dec 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OAVA: the open audio-visual archives aggregator Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Conformer-Based Speech Recognition On Extreme Edge-Computing Devices Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Seq2seq for Automatic Paraphasia Detection in Aphasic Speech Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data Dec 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FastInject: Injecting Unpaired Text Data into CTC-based ASR training Dec 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Extending Whisper with prompt tuning to target-speaker ASR Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Creating Spoken Dialog Systems in Ultra-Low Resourced Settings Dec 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning Dec 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition Dec 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bigger is not Always Better: The Effect of Context Size on Speech Pre-Training Dec 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-to-End Speech-to-Text Translation: A Survey Dec 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data Nov 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0D4AM: A General Denoising Framework for Downstream Acoustic Models Nov 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors Nov 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0How does end-to-end speech recognition training impact speech enhancement artifacts? Nov 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition Nov 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding Nov 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer Nov 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-channel Conversational Speaker Separation via Neural Diarization Nov 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models Nov 14, 2023 Acoustic Scene Classification Audio captioning
Code Code Available 3Retrieve and Copy: Scaling ASR Personalization to Large Catalogs Nov 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition Nov 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation Nov 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 11SPU: 1-step Speech Processing Unit Nov 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition Nov 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning Nov 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition Nov 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning Nov 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts Nov 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual Assistants Nov 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios Oct 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics Oct 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0CL-MASR: A Continual Learning Benchmark for Multilingual ASR Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ArTST: Arabic Text and Speech Transformer Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1