Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 CLSRIL-23: Cross Lingual Speech Representations for Indic Languages Jul 15, 2021 Self-Supervised Learning speech-recognition
Code Code Available 15 A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages Jun 13, 2023 Contrastive Learning speech-recognition
Code Code Available 15 CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition Jun 1, 2022 Audio-Visual Speech Recognition speech-recognition
Code Code Available 15 AI Accelerator Survey and Trends Sep 18, 2021 Benchmarking Computational Efficiency
Code Code Available 15 CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition May 27, 2019 Decoder Language Modelling
Code Code Available 15 Attention model for articulatory features detection Jul 2, 2019 Manner Of Articulation Detection model
Code Code Available 15 FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer Mar 19, 2024 Representation Learning speech-recognition
Code Code Available 15 Adversarial Attacks against Windows PE Malware Detection: A Survey of the State-of-the-Art Dec 23, 2021 Adversarial Attack Malware Detection
Code Code Available 15 CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models Apr 3, 2024 Optical Character Recognition (OCR) speech-recognition
Code Code Available 15 It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition Feb 8, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 Jointly Learning Visual and Auditory Speech Representations from Raw Data Dec 12, 2022 Audio-Visual Speech Recognition Lipreading
Code Code Available 15 FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach for Deep Neural Networks Jan 18, 2020 Bayesian Optimization Object Detection
Code Code Available 15 AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline Sep 16, 2017 speech-recognition Speech Recognition
Code Code Available 15 A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AISHELL-NER: Named Entity Recognition from Chinese Speech Feb 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline Sep 22, 2020 speech-recognition Speech Recognition
Code Code Available 15 FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge Dec 15, 2023 Backdoor Attack Data Poisoning
Code Code Available 15 Compiling ONNX Neural Network Models Using MLIR Aug 19, 2020 speech-recognition Speech Recognition
Code Code Available 15 Computer-Generated Music for Tabletop Role-Playing Games Aug 16, 2020 speech-recognition Speech Recognition
Code Code Available 15 Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition Feb 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Kosp2e: Korean Speech to English Translation Corpus Jul 6, 2021 speech-recognition Speech Recognition
Code Code Available 15 Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 FedScale: Benchmarking Model and System Performance of Federated Learning at Scale May 24, 2021 Benchmarking Federated Learning
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI Dec 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Language and Speech Technology for Central Kurdish Varieties Mar 4, 2024 Automatic Speech Recognition Diversity
Code Code Available 15 Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring Sep 19, 2023 Feature Engineering Phone-level pronunciation scoring
Code Code Available 15 Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Advancing Test-Time Adaptation in Wild Acoustic Test Settings Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition Jun 30, 2019 Avg Representation Learning
Code Code Available 15 Attack on practical speaker verification system using universal adversarial perturbations May 19, 2021 Real-World Adversarial Attack Room Impulse Response (RIR)
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A context-aware knowledge transferring strategy for CTC-based ASR Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 CoVoST 2 and Massively Multilingual Speech-to-Text Translation Jul 20, 2020 Machine Translation speech-recognition
Code Code Available 15 Cross Attention Augmented Transducer Networks for Simultaneous Translation Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-Based Models for Speech Recognition Jun 24, 2015 Machine Translation Phoneme Recognition
Code Code Available 15 Radically Old Way of Computing Spectra: Applications in End-to-End ASR Mar 25, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 CTC-synchronous Training for Monotonic Attention Model May 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition Oct 21, 2024 cross-modal alignment speech-recognition
Code Code Available 15 DARF: A data-reduced FADE version for simulations of speech recognition thresholds with real hearing aids Jul 10, 2020 Sentence speech-recognition
Code Code Available 15 Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language Feb 7, 2022 image-classification Image Classification
Code Code Available 15 Learning to Detect Noisy Labels Using Model-Based Features Dec 28, 2022 Meta-Learning speech-recognition
Code Code Available 15 A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing Mar 18, 2022 Representation Learning Speaker Verification
Code Code Available 15 FlanEC: Exploring Flan-T5 for Post-ASR Error Correction Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15