INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Mixture-of-Expert Conformer for Streaming Multilingual ASR May 25, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification May 25, 2023 Classification speech-recognition
— Unverified 0Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data May 25, 2023 Knowledge Distillation Speech Extraction
— Unverified 0VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation May 25, 2023 Decoder Language Modeling
— Unverified 0ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Svarah: Evaluating English ASR Systems on Indian Accents May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Scheduled Sampling for Neural Transducer-based ASR May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition May 25, 2023 Denoising Self-Supervised Learning
— Unverified 0InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models May 24, 2023 Machine Translation Model Compression
— Unverified 0Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM May 24, 2023 Language Modelling Question Answering
Code Code Available 0Iteratively Improving Speech Recognition and Voice Conversion May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Textless Speech-to-Speech Translation With Limited Parallel Data May 24, 2023 Automatic Speech Recognition Denoising
Code Code Available 0On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Personalized Predictive ASR for Latency Reduction in Voice Assistants May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SE-Bridge: Speech Enhancement with Consistent Brownian Bridge May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers May 23, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding May 23, 2023 Decoder speech-recognition
— Unverified 0TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning May 23, 2023 Metric Learning speech-recognition
— Unverified 0Text Generation with Speech Synthesis for ASR Data Augmentation May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Modular Domain Adaptation for Conformer-Based Streaming ASR May 22, 2023 Domain Adaptation speech-recognition
— Unverified 0Scaling Speech Technology to 1,000+ Languages May 22, 2023 Automatic Speech Recognition Language Identification
Code Code Available 1Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition May 22, 2023 Sentence speech-recognition
— Unverified 0Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test May 22, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0CopyNE: Better Contextual ASR by Copying Named Entities May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1GNCformer Enhanced Self-attention for Automatic Speech Recognition May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction May 21, 2023 Action Detection Activity Detection
— Unverified 0VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages May 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting May 21, 2023 Denoising Keyword Spotting
— Unverified 0Hystoc: Obtaining word confidences for fusion of end-to-end ASR systems May 21, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network May 21, 2023 speech-recognition Speech Recognition
— Unverified 0CASA-ASR: Context-Aware Speaker-Attributed ASR May 21, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition May 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-Head State Space Model for Speech Recognition May 21, 2023 Language Modeling Language Modelling
— Unverified 0Self-supervised representations in speech-based depression detection May 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised ASR via Cross-Lingual Pseudo-Labeling May 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning May 19, 2023 Multi-Task Learning speech-recognition
— Unverified 0BAT: Boundary aware transducer for memory-efficient and low-latency ASR May 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition May 19, 2023 Diversity Self-Supervised Learning
— Unverified 0Language-universal phonetic encoder for low-resource speech recognition May 19, 2023 Decoder speech-recognition
— Unverified 0Blank-regularized CTC for Frame Skipping in Neural Transducer May 19, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks May 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering May 18, 2023 Acoustic Unit Discovery Clustering
Code Code Available 1