Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms Mar 16, 2023 Multi-Task Learning Speech Enhancement
— Unverified 0Trustera: A Live Conversation Redaction System Mar 16, 2023 Automatic Speech Recognition Natural Language Understanding
— Unverified 0DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model Mar 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A large-scale multimodal dataset of human speech recognition Mar 15, 2023 Lip Reading Motion Detection
— Unverified 0Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences Mar 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism Mar 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring Mar 15, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models Mar 15, 2023 speech-recognition Speech Recognition
— Unverified 0Improving Accented Speech Recognition with Multi-Domain Training Mar 14, 2023 Accented Speech Recognition Automatic Speech Recognition
— Unverified 0Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy Mar 14, 2023 Position Sentence
— Unverified 0I3D: Transformer architectures with input-dependent dynamic depth for speech recognition Mar 14, 2023 Model Compression speech-recognition
Code Code Available 0Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model Mar 13, 2023 Decision Making Scene Text Recognition
— Unverified 0Improving the Intent Classification accuracy in Noisy Environment Mar 12, 2023 Automatic Speech Recognition Classification
— Unverified 0Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study Mar 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge Mar 11, 2023 Audio-Visual Speech Recognition speech-recognition
— Unverified 0Stabilizing Transformer Training by Preventing Attention Entropy Collapse Mar 11, 2023 Automatic Speech Recognition image-classification
Code Code Available 2Transcription free filler word detection with Neural semi-CRFs Mar 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0MIXPGD: Hybrid Adversarial Training for Speech Recognition Systems Mar 10, 2023 Adversarial Attack Automatic Speech Recognition
— Unverified 0Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings Mar 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Overview on Language Models: Recent Developments and Outlook Mar 10, 2023 Language Modeling Language Modelling
— Unverified 0Unsupervised Language agnostic WER Standardization Mar 9, 2023 speech-recognition Speech Recognition
— Unverified 0DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks Mar 8, 2023 Fault Detection speech-recognition
Code Code Available 0TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings Mar 7, 2023 Action Detection Activity Detection
Code Code Available 1wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts Mar 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Calibrating Transformers via Sparse Gaussian Processes Mar 4, 2023 Bayesian Inference Gaussian Processes
Code Code Available 1End-to-End Speech Recognition: A Survey Mar 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis Mar 3, 2023 Emotion Recognition Knowledge Distillation
— Unverified 0SottoVoce: An Ultrasound Imaging-Based Silent Speech Interaction Using Deep Neural Networks Mar 3, 2023 speech-recognition Speech Recognition
— Unverified 0Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages Mar 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Large Text Corpora for End-to-End Speech Summarization Mar 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion Mar 2, 2023 Grapheme-to-Phoneme Conversion speech-recognition
— Unverified 0Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition Mar 1, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space Mar 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation Mar 1, 2023 Audio-Visual Speech Recognition Robust Speech Recognition
Code Code Available 2Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition Mar 1, 2023 Acoustic echo cancellation Automatic Speech Recognition
— Unverified 0Practice of the conformer enhanced AUDIO-VISUAL HUBERT on Mandarin and English Feb 28, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition Feb 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition Feb 28, 2023 speech-recognition Speech Recognition
— Unverified 0A Token-Wise Beam Search Algorithm for RNN-T Feb 28, 2023 speech-recognition Speech Recognition
— Unverified 0BrainBERT: Self-supervised representation learning for intracranial recordings Feb 28, 2023 Language Modeling Language Modelling
Code Code Available 1Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Diagonal State Space Augmented Transformers for Speech Recognition Feb 27, 2023 speech-recognition Speech Recognition
— Unverified 0Diacritic Recognition Performance in Arabic ASR Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Explanations for Automatic Speech Recognition Feb 27, 2023 Automatic Speech Recognition Explainable Artificial Intelligence (XAI)
— Unverified 0Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding Feb 27, 2023 Model Compression Representation Learning
Code Code Available 1MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multimodal Speech Recognition for Language-Guided Embodied Agents Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0