Decoder-only Architecture for Streaming End-to-end Speech Recognition Jun 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Acoustic Feature Mixup for Balanced Multi-aspect Pronunciation Assessment Jun 22, 2024 speech-recognition Speech Recognition
— Unverified 0Perception of Phonological Assimilation by Neural Speech Recognition Models Jun 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PI-Whisper: Designing an Adaptive and Incremental Automatic Speech Recognition System for Edge Devices Jun 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0InterBiasing: Boost Unseen Word Recognition through Biasing Intermediate Predictions Jun 21, 2024 speech-recognition Speech Recognition
— Unverified 0An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks Jun 20, 2024 Automatic Speech Recognition Decoder
— Unverified 0Intelligent Interface: Enhancing Lecture Engagement with Didactic Activity Summaries Jun 20, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0DASB -- Discrete Audio and Speech Benchmark Jun 20, 2024 Benchmarking Emotion Recognition
— Unverified 0Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control Jun 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ManWav: The First Manchu ASR Model Jun 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Children's Speech Recognition through Discrete Token Enhancement Jun 19, 2024 speech-recognition Speech Recognition
— Unverified 0Transcribe, Align and Segment: Creating speech datasets for low-resource languages Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting Jun 18, 2024 Decoder Language Identification
— Unverified 0Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Performant ASR Models for Medical Entities in Accented Speech Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised Online Continual Learning for Automatic Speech Recognition Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Self-Train Before You Transcribe Jun 17, 2024 Domain Adaptation Language Modelling
Code Code Available 0Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition Jun 16, 2024 Automatic Speech Recognition Data Poisoning
— Unverified 0Optimized Speculative Sampling for GPU Hardware Accelerators Jun 16, 2024 Automatic Speech Recognition GPU
Code Code Available 0Automatic Speech Recognition for Biomedical Data in Bengali Language Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large Language Models for Dysfluency Detection in Stuttered Speech Jun 16, 2024 Automatic Speech Recognition Language Modeling
— Unverified 0CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Emotion Recognition Using CNN and Its Use Case in Digital Healthcare Jun 15, 2024 Emotion Recognition Speech Emotion Recognition
— Unverified 0Trading Devil: Robust backdoor attack via Stochastic investment models and Bayesian approach Jun 15, 2024 Backdoor Attack speech-recognition
— Unverified 0An efficient text augmentation approach for contextualized Mandarin speech recognition Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learning Language Structures through Grounding Jun 14, 2024 Automatic Speech Recognition Dependency Parsing
— Unverified 0Optimizing Byte-level Representation for End-to-end ASR Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Evaluation of Speech Foundation Models for Spoken Language Understanding Jun 14, 2024 Benchmarking Prediction
— Unverified 0Inclusive ASR for Disfluent Speech: Cascaded Large-Scale Self-Supervised Learning with Targeted Fine-Tuning and Data Augmentation Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge Jun 14, 2024 speech-recognition Speech Recognition
— Unverified 0Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition Jun 14, 2024 speech-recognition Speech Recognition
— Unverified 0ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AdaPTwin: Low-Cost Adaptive Compression of Product Twins in Transformers Jun 13, 2024 speech-recognition Speech Recognition
— Unverified 0Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech ReaLLM -- Real-time Streaming Speech Recognition with Multimodal LLMs by Teaching the Flow of Time Jun 13, 2024 Decoder speech-recognition
— Unverified 0Multi-Modal Retrieval For Large Language Model Based Speech Recognition Jun 13, 2024 Automatic Speech Recognition Language Modeling
— Unverified 0The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech Jun 13, 2024 Language Identification speaker-diarization
— Unverified 0Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Neural Blind Source Separation and Diarization for Distant Speech Recognition Jun 12, 2024 blind source separation Distant Speech Recognition
— Unverified 0Towards Unsupervised Speech Recognition Without Pronunciation Models Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Transformer-based Model for ASR N-Best Rescoring and Rewriting Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PolySpeech: Exploring Unified Multitask Speech Models for Competitiveness with Single-task Models Jun 12, 2024 Language Modeling Language Modelling
— Unverified 0