Confidence-based Ensembles of End-to-End Speech Recognition Models Jun 27, 2023 Language Identification Model Selection
— Unverified 0Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems Jun 26, 2023 Diversity speech-recognition
— Unverified 0Meta-Gating Framework for Fast and Continuous Resource Optimization in Dynamic Wireless Environments Jun 23, 2023 image-classification Image Classification
— Unverified 0Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems Jun 23, 2023 speech-recognition Speech Recognition
— Unverified 0Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning Jun 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios Jun 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AudioPaLM: A Large Language Model That Can Speak and Listen Jun 22, 2023 Language Modeling Language Modelling
— Unverified 0Strategies in Transfer Learning for Low-Resource Speech Synthesis: Phone Mapping, Features Input, and Source Language Selection Jun 21, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Mixture Encoder for Joint Speech Separation and Recognition Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learning When to Trust Which Teacher for Weakly Supervised ASR Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Federated Self-Learning with Weak Supervision for Speech Recognition Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring the Role of Audio in Video Captioning Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation Jun 20, 2023 Cross-corpus Sentence
Code Code Available 0Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition Jun 20, 2023 Accented Speech Recognition speech-recognition
— Unverified 0Rehearsal-Free Online Continual Learning for Automatic Speech Recognition Jun 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition Jun 18, 2023 Decoder Domain Adaptation
Code Code Available 0Distillation Strategies for Discriminative Speech Recognition Rescoring Jun 15, 2023 Language Modeling Language Modelling
— Unverified 0MobileASR: A resource-aware on-device learning framework for user voice personalization applications on mobile phones Jun 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction Jun 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer Jun 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation Jun 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0EM-Network: Oracle Guided Self-distillation for Sequence Learning Jun 14, 2023 Decoder Machine Translation
— Unverified 0Automated Speaker Independent Visual Speech Recognition: A Comprehensive Survey Jun 14, 2023 speech-recognition Speech Recognition
— Unverified 0Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition Jun 14, 2023 Data Augmentation speech-recognition
— Unverified 0Feature Normalization for Fine-tuning Self-Supervised Models in Speech Enhancement Jun 14, 2023 Self-Supervised Learning Speech Enhancement
— Unverified 0Research on an improved Conformer end-to-end Speech Recognition Model with R-Drop Structure Jun 14, 2023 Domain Adaptation speech-recognition
— Unverified 0DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer ASR Jun 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Large-scale Language Model Rescoring on Long-form Data Jun 13, 2023 Form Language Modeling
— Unverified 0Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition Jun 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition Jun 12, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation Jun 12, 2023 Diversity speech-recognition
— Unverified 0On the N-gram Approximation of Pre-trained Language Models Jun 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multimodal Audio-textual Architecture for Robust Spoken Language Understanding Jun 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Impact of Experiencing Misrecognition by Teachable Agents on Learning and Rapport Jun 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model Jun 10, 2023 Automatic Speech Recognition Prosody Prediction
— Unverified 0Modality Influence in Multimodal Machine Learning Jun 10, 2023 Decision Making Emotion Recognition
— Unverified 0Adversarial Training For Low-Resource Disfluency Correction Jun 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Theory of Unsupervised Speech Recognition Jun 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition Jun 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Record Deduplication for Entity Distribution Modeling in ASR Transcripts Jun 9, 2023 Entity Resolution speech-recognition
— Unverified 0Improving Language Model Integration for Neural Machine Translation Jun 8, 2023 Automatic Speech Recognition Language Modeling
— Unverified 0Latent Phrase Matching for Dysarthric Speech Jun 8, 2023 speech-recognition Speech Recognition
— Unverified 0Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency Jun 7, 2023 Machine Translation speech-recognition
— Unverified 0Label Aware Speech Representation Learning For Language Identification Jun 7, 2023 Language Identification Missing Labels
— Unverified 0An ASR-Based Tutor for Learning to Read: How to Optimize Feedback to First Graders Jun 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FOOCTTS: Generating Arabic Speech with Acoustic Environment for Football Commentator Jun 7, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak Jun 7, 2023 speech-recognition Speech Recognition
— Unverified 0Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer Jun 7, 2023 Domain Adaptation Language Modeling
— Unverified 0A study on the impact of Self-Supervised Learning on automatic dysarthric speech assessment Jun 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation Jun 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0