RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition May 28, 2023 Decoder Sequence-To-Sequence Speech Recognition
— Unverified 0A Comprehensive Overview and Comparative Analysis on Deep Learning Models: CNN, RNN, LSTM, GRU May 27, 2023 Autonomous Vehicles Deep Learning
— Unverified 0DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robustness of Multi-Source MT to Transcription Errors May 26, 2023 automatic-speech-translation Machine Translation
— Unverified 02-bit Conformer quantization for automatic speech recognition May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification May 25, 2023 Classification speech-recognition
— Unverified 0Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition May 25, 2023 Denoising Self-Supervised Learning
— Unverified 0Mixture-of-Expert Conformer for Streaming Multilingual ASR May 25, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data May 25, 2023 Knowledge Distillation Speech Extraction
— Unverified 0ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Scheduled Sampling for Neural Transducer-based ASR May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation May 25, 2023 Decoder Language Modeling
— Unverified 0INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Svarah: Evaluating English ASR Systems on Indian Accents May 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM May 24, 2023 Language Modelling Question Answering
Code Code Available 0InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Textless Speech-to-Speech Translation With Limited Parallel Data May 24, 2023 Automatic Speech Recognition Denoising
Code Code Available 0Iteratively Improving Speech Recognition and Voice Conversion May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models May 24, 2023 Machine Translation Model Compression
— Unverified 0Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding May 23, 2023 Decoder speech-recognition
— Unverified 0Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning May 23, 2023 Metric Learning speech-recognition
— Unverified 0Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Personalized Predictive ASR for Latency Reduction in Voice Assistants May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers May 23, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SE-Bridge: Speech Enhancement with Consistent Brownian Bridge May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text Generation with Speech Synthesis for ASR Data Augmentation May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition May 22, 2023 Sentence speech-recognition
— Unverified 0Modular Domain Adaptation for Conformer-Based Streaming ASR May 22, 2023 Domain Adaptation speech-recognition
— Unverified 0GNCformer Enhanced Self-attention for Automatic Speech Recognition May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test May 22, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages May 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-Head State Space Model for Speech Recognition May 21, 2023 Language Modeling Language Modelling
— Unverified 0DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting May 21, 2023 Denoising Keyword Spotting
— Unverified 0On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition May 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network May 21, 2023 speech-recognition Speech Recognition
— Unverified 0Hystoc: Obtaining word confidences for fusion of end-to-end ASR systems May 21, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0CASA-ASR: Context-Aware Speaker-Attributed ASR May 21, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction May 21, 2023 Action Detection Activity Detection
— Unverified 0Self-supervised representations in speech-based depression detection May 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Language-universal phonetic encoder for low-resource speech recognition May 19, 2023 Decoder speech-recognition
— Unverified 0BAT: Boundary aware transducer for memory-efficient and low-latency ASR May 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised ASR via Cross-Lingual Pseudo-Labeling May 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Blank-regularized CTC for Frame Skipping in Neural Transducer May 19, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0