Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Sep 18, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech Oct 1, 2023 speech-recognition Speech Recognition
Code Code Available 1Approaching Deep Learning through the Spectral Dynamics of Weights Aug 21, 2024 Deep Learning image-classification
Code Code Available 1ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi Apr 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Factorized Neural Transducer for Efficient Language Model Adaptation Sep 27, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Attack on practical speaker verification system using universal adversarial perturbations May 19, 2021 Real-World Adversarial Attack Room Impulse Response (RIR)
Code Code Available 1Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring Sep 19, 2023 Feature Engineering Phone-level pronunciation scoring
Code Code Available 1Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer Mar 19, 2024 Representation Learning speech-recognition
Code Code Available 1FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations Mar 25, 2022 Federated Learning Quantization
Code Code Available 1Foundation Transformers Oct 12, 2022 Language Modeling Language Modelling
Code Code Available 1BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation Nov 17, 2024 Action Recognition backdoor defense
Code Code Available 1Generative Pre-Training for Speech with Autoregressive Predictive Coding Oct 23, 2019 Representation Learning Speaker Identification
Code Code Available 1CLSRIL-23: Cross Lingual Speech Representations for Indic Languages Jul 15, 2021 Self-Supervised Learning speech-recognition
Code Code Available 1Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Global Normalization for Streaming Speech Recognition in a Modular Framework May 26, 2022 speech-recognition Speech Recognition
Code Code Available 1Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview Oct 14, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1GPU-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition Oct 22, 2019 CPU Decoder
Code Code Available 1GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition Nov 8, 2023 CPU Decoder
Code Code Available 1indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HiFi-VC: High Quality ASR-Based Voice Conversion Mar 31, 2022 speech-recognition Speech Recognition
Code Code Available 1How2: A Large-scale Dataset for Multimodal Language Understanding Nov 1, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset Oct 9, 2021 Deep Learning Emotion Recognition
Code Code Available 1OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation Aug 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Neural Morphological Analyzer for Arapaho Verbs Learned from a Finite State Transducer Aug 1, 2018 Decoder Machine Translation
— Unverified 0A neural document language modeling framework for spoken document retrieval Oct 31, 2019 Information Retrieval Language Modeling
— Unverified 0A Deep Learning based Wearable Healthcare IoT Device for AI-enabled Hearing Assistance Automation May 16, 2020 speech-recognition Speech Recognition
— Unverified 0A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data Jun 1, 2021 Acoustic echo cancellation Automatic Speech Recognition
— Unverified 0A network of deep neural networks for distant speech recognition Mar 23, 2017 Distant Speech Recognition Speech Enhancement
— Unverified 0A deep-learning based native-language classification by using a latent semantic analysis for the NLI Shared Task 2017 Sep 1, 2017 Automatic Speech Recognition (ASR) Dimensionality Reduction
— Unverified 0A comprehensive analysis on attention models Oct 22, 2018 speech-recognition Speech Recognition
— Unverified 0An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition Oct 12, 2022 Ensemble Learning Privacy Preserving
— Unverified 0An enhanced automatic speech recognition system for Arabic Apr 1, 2017 Arabic Speech Recognition Automatic Speech Recognition
— Unverified 0A Deep Learning Approach for Similar Languages, Varieties and Dialects Jan 2, 2019 Deep Learning Dialect Identification
— Unverified 0An End-to-End Text-independent Speaker Verification Framework with a Keyword Adversarial Network Aug 6, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An End-to-End Speech Recognition for the Nepali Language Dec 1, 2021 Decoder Language Modeling
— Unverified 0A Deep Generative Acoustic Model for Compositional Automatic Speech Recognition Oct 23, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition Apr 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems Dec 15, 2014 Decoder speech-recognition
— Unverified 0A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition Oct 1, 2022 Phoneme Recognition speech-recognition
— Unverified 0An End-to-End Mispronunciation Detection System for L2 English Speech Leveraging Novel Anti-Phone Modeling May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-Supervised Learning for Multi-Channel Neural Transducer Aug 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An End-to-end Architecture of Online Multi-channel Speech Separation Sep 7, 2020 speech-recognition Speech Recognition
— Unverified 0An Empirical Study of Language Model Integration for Transducer based Speech Recognition Mar 31, 2022 Language Modeling Language Modelling
— Unverified 0A Deep Dive into Deep Cluster Jul 24, 2022 Clustering speech-recognition
— Unverified 0Assessing the Tolerance of Neural Machine Translation Systems Against Speech Recognition Errors Apr 24, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Empirical Study of Efficient ASR Rescoring with Transformers Oct 24, 2019 Knowledge Distillation Language Modeling
— Unverified 0