Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AV Taris: Online Audio-Visual Speech Recognition Dec 14, 2020 Action Detection Activity Detection
Code Code Available 15 Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients Nov 11, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Back Translation for Speech-to-text Translation Without Transcripts May 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Adapting End-to-End Speech Recognition for Readable Subtitles May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 15 Distilling the Knowledge of BERT for Sequence-to-Sequence ASR Aug 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model Jun 2, 2023 speech-recognition Speech Recognition
Code Code Available 15 Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement Jun 6, 2024 Diversity Speech Enhancement
Code Code Available 15 Emotion Recognition in Audio and Video Using Deep Neural Networks Jun 15, 2020 Deep Learning Emotion Recognition
Code Code Available 15 BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 EnCodecMAE: Leveraging neural codecs for universal audio representation learning Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Adapting Pretrained Transformer to Lattices for Spoken Language Understanding Nov 2, 2020 Natural Language Understanding speech-recognition
Code Code Available 15 BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG data Jan 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 15 End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation Nov 1, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 15 BIG-C: a Multimodal Multi-Purpose Dataset for Bemba May 26, 2023 Machine Translation speech-recognition
Code Code Available 15 CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition Jun 1, 2022 Audio-Visual Speech Recognition speech-recognition
Code Code Available 15 Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Sep 18, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech Oct 1, 2023 speech-recognition Speech Recognition
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi Apr 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Factorized Neural Transducer for Efficient Language Model Adaptation Sep 27, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-Based Models for Speech Recognition Jun 24, 2015 Machine Translation Phoneme Recognition
Code Code Available 15 Attack on practical speaker verification system using universal adversarial perturbations May 19, 2021 Real-World Adversarial Attack Room Impulse Response (RIR)
Code Code Available 15 A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline Sep 22, 2020 speech-recognition Speech Recognition
Code Code Available 15 BrainBERT: Self-supervised representation learning for intracranial recordings Feb 28, 2023 Language Modeling Language Modelling
Code Code Available 15 Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Bridging the Gap between Spatial and Spectral Domains: A Unified Framework for Graph Neural Networks Jul 21, 2021 Image Classification Natural Language Understanding
Code Code Available 15 A Comparison of Methods for OOV-word Recognition on a New Public Dataset Jul 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition Sep 21, 2023 speech-recognition Speech Recognition
Code Code Available 15 dMel: Speech Tokenization made Simple Jul 22, 2024 Decoder Language Modeling
Code Code Available 15 Byakto Speech: Real-time long speech synthesis with convolutional neural network: Transfer learning from English to Bangla May 31, 2021 Deep Learning speech-recognition
Code Code Available 15 AdaScale SGD: A User-Friendly Algorithm for Distributed Training Jul 9, 2020 image-classification Image Classification
Code Code Available 15 Calibrating Transformers via Sparse Gaussian Processes Mar 4, 2023 Bayesian Inference Gaussian Processes
Code Code Available 15 CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings Jun 6, 2021 Machine Translation speech-recognition
Code Code Available 15 Can we use Common Voice to train a Multi-Speaker TTS system? Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 FlanEC: Exploring Flan-T5 for Post-ASR Error Correction Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Differentiable Weighted Finite-State Transducers Oct 2, 2020 Handwriting Recognition speech-recognition
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15