DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 02-bit Conformer quantization for automatic speech recognition May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Scheduled Sampling for Neural Transducer-based ASR May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Svarah: Evaluating English ASR Systems on Indian Accents May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Iteratively Improving Speech Recognition and Voice Conversion May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Personalized Predictive ASR for Latency Reduction in Voice Assistants May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SE-Bridge: Speech Enhancement with Consistent Brownian Bridge May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text Generation with Speech Synthesis for ASR Data Augmentation May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GNCformer Enhanced Self-attention for Automatic Speech Recognition May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition May 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction May 21, 2023 Action Detection Activity Detection
— Unverified 0VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages May 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-supervised representations in speech-based depression detection May 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BAT: Boundary aware transducer for memory-efficient and low-latency ASR May 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised ASR via Cross-Lingual Pseudo-Labeling May 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks May 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Lexical-aware Non-autoregressive Transformer-based ASR Model May 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion May 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Critical Appraisal of Artificial Intelligence-Mediated Communication May 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking May 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations May 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes May 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Masked Audio Text Encoders are Effective Multi-Modal Rescorers May 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Quran Recitation Recognition using End-to-End Deep Learning May 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models May 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Who Needs Decoders? Efficient Estimation of Sequence-level Attributes May 9, 2023 Attribute Automatic Speech Recognition
— Unverified 0Multi-Temporal Lip-Audio Memory for Visual Speech Recognition May 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers May 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Employing Hybrid Deep Neural Networks on Dari Speech May 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders May 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks May 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge May 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale Apr 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR Apr 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization Apr 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Understanding Shared Speech-Text Representations Apr 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition Apr 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding Apr 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OLISIA: a Cascade System for Spoken Dialogue State Tracking Apr 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Towards the Universal Defense for Query-Based Audio Adversarial Attacks Apr 20, 2023 Audio Fingerprint Automatic Speech Recognition
— Unverified 0