Explainability of Speech Recognition Transformers via Gradient-based Attention Visualization Jun 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Streaming Speech-to-Confusion Network Speech Recognition Jun 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Can Contextual Biasing Remain Effective with Whisper and GPT-2? Jun 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Audio-Visual Speech Enhancement with Score-Based Generative Models Jun 2, 2023 Automatic Speech Recognition Lipreading
— Unverified 0DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model Jun 2, 2023 speech-recognition Speech Recognition
Code Code Available 1Tensor decomposition for minimization of E2E SLU model toward on-device processing Jun 2, 2023 speech-recognition Speech Recognition
— Unverified 0Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation Jun 2, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Improved DeepFake Detection Using Whisper Features Jun 2, 2023 Automatic Speech Recognition DeepFake Detection
Code Code Available 1On Crowdsourcing-design with Comparison Category Rating for Evaluating Speech Enhancement Algorithms Jun 2, 2023 Speech Enhancement speech-recognition
— Unverified 0Some voices are too common: Building fair speech recognition systems using the Common Voice dataset Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Robustness of Arabic Speech Dialect Identification Jun 1, 2023 Dialect Identification Self-Supervised Learning
— Unverified 0Adapting an Unadaptable ASR System Jun 1, 2023 speech-recognition Speech Recognition
— Unverified 0Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning Jun 1, 2023 Contrastive Learning speech-recognition
— Unverified 0Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition Jun 1, 2023 Prediction speech-recognition
— Unverified 0Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations Jun 1, 2023 Data Augmentation Domain Adaptation
— Unverified 0Adaptation and Optimization of Automatic Speech Recognition (ASR) for the Maritime Domain in the Field of VHF Communication Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech inpainting: Context-based speech synthesis guided by video Jun 1, 2023 speech-recognition Speech Recognition
— Unverified 0AfriNames: Most ASR models "butcher" African Names Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Encoder-decoder multimodal speaker change detection Jun 1, 2023 Automatic Speech Recognition Change Detection
— Unverified 0SlothSpeech: Denial-of-service Attack Against Speech Recognition Models Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Strategies for improving low resource speech to text translation relying on pre-trained ASR models May 31, 2023 Automatic Speech Recognition Decoder
— Unverified 0The Tag-Team Approach: Leveraging CLS and Language Tagging for Enhancing Multilingual ASR May 31, 2023 speech-recognition Speech Recognition
— Unverified 0Accurate and Structured Pruning for Efficient Automatic Speech Recognition May 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning May 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Zero-Shot Automatic Pronunciation Assessment May 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Perception and Semantic Aware Regularization for Sequential Confidence Calibration May 31, 2023 Language Modelling speech-recognition
Code Code Available 1VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition May 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The News Delivery Channel Recommendation Based on Granular Neural Network May 30, 2023 Collaborative Filtering Deep Learning
— Unverified 0Towards Selection of Text-to-speech Data to Augment ASR Training May 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adapting Multi-Lingual ASR Models for Handling Multiple Talkers May 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions May 30, 2023 All Automatic Speech Recognition
— Unverified 0Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator May 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Building Accurate Low Latency ASR for Streaming Voice Search May 29, 2023 Action Detection Activity Detection
— Unverified 0Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target May 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation May 29, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition May 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings May 29, 2023 Clustering speaker-diarization
— Unverified 0HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition May 29, 2023 speech-recognition Speech Recognition
— Unverified 0CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice May 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition May 28, 2023 Decoder Sequence-To-Sequence Speech Recognition
— Unverified 0Bridging the Granularity Gap for Acoustic Modeling May 27, 2023 speech-recognition Speech Recognition
Code Code Available 1A Comprehensive Overview and Comparative Analysis on Deep Learning Models: CNN, RNN, LSTM, GRU May 27, 2023 Autonomous Vehicles Deep Learning
— Unverified 0BIG-C: a Multimodal Multi-Purpose Dataset for Bemba May 26, 2023 Machine Translation speech-recognition
Code Code Available 1Robustness of Multi-Source MT to Transcription Errors May 26, 2023 automatic-speech-translation Machine Translation
— Unverified 02-bit Conformer quantization for automatic speech recognition May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0