Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology Reports Jun 14, 2023 Decoder speech-recognition
Code Code Available 1Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition Jun 14, 2023 Data Augmentation speech-recognition
— Unverified 0Feature Normalization for Fine-tuning Self-Supervised Models in Speech Enhancement Jun 14, 2023 Self-Supervised Learning Speech Enhancement
— Unverified 0Research on an improved Conformer end-to-end Speech Recognition Model with R-Drop Structure Jun 14, 2023 Domain Adaptation speech-recognition
— Unverified 0Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation Jun 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ITALIC: An Italian Intent Classification Dataset Jun 14, 2023 Classification intent-classification
Code Code Available 1Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer Jun 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automated Speaker Independent Visual Speech Recognition: A Comprehensive Survey Jun 14, 2023 speech-recognition Speech Recognition
— Unverified 0Large-scale Language Model Rescoring on Long-form Data Jun 13, 2023 Form Language Modeling
— Unverified 0Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition Jun 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer ASR Jun 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages Jun 13, 2023 Contrastive Learning speech-recognition
Code Code Available 1Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation Jun 12, 2023 Diversity speech-recognition
— Unverified 0Multimodal Audio-textual Architecture for Robust Spoken Language Understanding Jun 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition Jun 12, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0On the N-gram Approximation of Pre-trained Language Models Jun 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Impact of Experiencing Misrecognition by Teachable Agents on Learning and Rapport Jun 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment Jun 10, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model Jun 10, 2023 Automatic Speech Recognition Prosody Prediction
— Unverified 0Adversarial Training For Low-Resource Disfluency Correction Jun 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Modality Influence in Multimodal Machine Learning Jun 10, 2023 Decision Making Emotion Recognition
— Unverified 0Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition Jun 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Theory of Unsupervised Speech Recognition Jun 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Record Deduplication for Entity Distribution Modeling in ASR Transcripts Jun 9, 2023 Entity Resolution speech-recognition
— Unverified 0Latent Phrase Matching for Dysarthric Speech Jun 8, 2023 speech-recognition Speech Recognition
— Unverified 0Improving Language Model Integration for Neural Machine Translation Jun 8, 2023 Automatic Speech Recognition Language Modeling
— Unverified 0FOOCTTS: Generating Arabic Speech with Acoustic Environment for Football Commentator Jun 7, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency Jun 7, 2023 Machine Translation speech-recognition
— Unverified 0An ASR-Based Tutor for Learning to Read: How to Optimize Feedback to First Graders Jun 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak Jun 7, 2023 speech-recognition Speech Recognition
— Unverified 0Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages Jun 7, 2023 Cross-Lingual Transfer speech-recognition
Code Code Available 1Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes Jun 7, 2023 Attribute Cross-Lingual Transfer
Code Code Available 1Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer Jun 7, 2023 Domain Adaptation Language Modeling
— Unverified 0A study on the impact of Self-Supervised Learning on automatic dysarthric speech assessment Jun 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Label Aware Speech Representation Learning For Language Identification Jun 7, 2023 Language Identification Missing Labels
— Unverified 0Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation Jun 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization Jun 7, 2023 Automatic Speech Recognition Decoder
— Unverified 0Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering Jun 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain Jun 6, 2023 Decision Making Robust Speech Recognition
— Unverified 0Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics Jun 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Alzheimer Disease Classification through ASR-based Transcriptions: Exploring the Impact of Punctuation and Pauses Jun 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Machine Unlearning: A Survey Jun 6, 2023 Machine Unlearning Medical Diagnosis
— Unverified 0N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition Jun 5, 2023 Arabic Speech Recognition Benchmarking
— Unverified 0Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition Jun 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition Jun 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information Jun 4, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1End-to-End Joint Target and Non-Target Speakers ASR Jun 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings Jun 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advancing African-Accented Speech Recognition: Epistemic Uncertainty-Driven Data Selection for Generalizable ASR Models Jun 3, 2023 Accented Speech Recognition Active Learning
Code Code Available 0SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization Jun 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1