Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy Mar 14, 2023 Position Sentence
— Unverified 0Improving Accented Speech Recognition with Multi-Domain Training Mar 14, 2023 Accented Speech Recognition Automatic Speech Recognition
— Unverified 0I3D: Transformer architectures with input-dependent dynamic depth for speech recognition Mar 14, 2023 Model Compression speech-recognition
Code Code Available 0Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model Mar 13, 2023 Decision Making Scene Text Recognition
— Unverified 0Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study Mar 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improving the Intent Classification accuracy in Noisy Environment Mar 12, 2023 Automatic Speech Recognition Classification
— Unverified 0Transcription free filler word detection with Neural semi-CRFs Mar 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge Mar 11, 2023 Audio-Visual Speech Recognition speech-recognition
— Unverified 0MIXPGD: Hybrid Adversarial Training for Speech Recognition Systems Mar 10, 2023 Adversarial Attack Automatic Speech Recognition
— Unverified 0Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings Mar 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Overview on Language Models: Recent Developments and Outlook Mar 10, 2023 Language Modeling Language Modelling
— Unverified 0Unsupervised Language agnostic WER Standardization Mar 9, 2023 speech-recognition Speech Recognition
— Unverified 0DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks Mar 8, 2023 Fault Detection speech-recognition
Code Code Available 0wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts Mar 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SottoVoce: An Ultrasound Imaging-Based Silent Speech Interaction Using Deep Neural Networks Mar 3, 2023 speech-recognition Speech Recognition
— Unverified 0Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis Mar 3, 2023 Emotion Recognition Knowledge Distillation
— Unverified 0End-to-End Speech Recognition: A Survey Mar 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion Mar 2, 2023 Grapheme-to-Phoneme Conversion speech-recognition
— Unverified 0Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages Mar 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Large Text Corpora for End-to-End Speech Summarization Mar 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition Mar 1, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space Mar 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition Mar 1, 2023 Acoustic echo cancellation Automatic Speech Recognition
— Unverified 0Practice of the conformer enhanced AUDIO-VISUAL HUBERT on Mandarin and English Feb 28, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition Feb 28, 2023 speech-recognition Speech Recognition
— Unverified 0Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition Feb 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Token-Wise Beam Search Algorithm for RNN-T Feb 28, 2023 speech-recognition Speech Recognition
— Unverified 0A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Diagonal State Space Augmented Transformers for Speech Recognition Feb 27, 2023 speech-recognition Speech Recognition
— Unverified 0Diacritic Recognition Performance in Arabic ASR Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A low latency attention module for streaming self-supervised speech representation learning Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Explanations for Automatic Speech Recognition Feb 27, 2023 Automatic Speech Recognition Explainable Artificial Intelligence (XAI)
— Unverified 0Multimodal Speech Recognition for Language-Guided Embodied Agents Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficient Ensemble for Multimodal Punctuation Restoration using Time-Delay Neural Network Feb 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Corpora Divergence Based Unsupervised Data Selection for ASR Feb 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0From Audio to Symbolic Encoding Feb 26, 2023 Information Retrieval Music Information Retrieval
— Unverified 0Chaotic Variational Auto encoder-based Adversarial Machine Learning Feb 25, 2023 speech-recognition Speech Recognition
— Unverified 0Ensemble knowledge distillation of self-supervised speech models Feb 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Pre-Finetuning for Few-Shot Emotional Speech Recognition Feb 24, 2023 Few-Shot Learning speech-recognition
Code Code Available 0Improving Massively Multilingual ASR With Auxiliary CTC Objectives Feb 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Factual Consistency Oriented Speech Recognition Feb 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generalization of Auto-Regressive Hidden Markov Models to Non-Linear Dynamics and Unit Quaternion Observation Space Feb 23, 2023 speech-recognition Speech Recognition
— Unverified 0Evaluating Automatic Speech Recognition in an Incremental Setting Feb 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UML: A Universal Monolingual Output Layer for Multilingual ASR Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0