emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface Electromyography Oct 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 Fast Transformers with Clustered Attention Jul 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 CMGAN: Conformer-based Metric GAN for Speech Enhancement Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels Mar 25, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 25 Large Language Models are Efficient Learners of Noise-Robust Speech Recognition Jan 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 Dompteur: Taming Audio Adversarial Examples Feb 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AV Taris: Online Audio-Visual Speech Recognition Dec 14, 2020 Action Detection Activity Detection
Code Code Available 15 Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation Nov 2, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 15 AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Back Translation for Speech-to-text Translation Without Transcripts May 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Distilling the Knowledge of BERT for Sequence-to-Sequence ASR Aug 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts Nov 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Mar 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup Nov 2, 2022 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 15 Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition Oct 26, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One Feb 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 CTC-synchronous Training for Monotonic Attention Model May 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ArTST: Arabic Text and Speech Transformer Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs Jun 26, 2024 ArzEn Code-switched Translation to ara ArzEn Code-switched Translation to eng
Code Code Available 15 A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR Error Correction with Constrained Decoding on Operation Prediction Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 D4AM: A General Denoising Framework for Downstream Acoustic Models Nov 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 CopyNE: Better Contextual ASR by Copying Named Entities May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15