Streaming Joint Speech Recognition and Disfluency Detection Nov 16, 2022 Decoder Language Modelling
Code Code Available 0CAT: CRF-based ASR Toolkit Nov 20, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures Nov 19, 2019 Language Modeling Language Modelling
Code Code Available 0Adaptive Cascading Network for Continual Test-Time Adaptation Jul 17, 2024 image-classification Image Classification
Code Code Available 0Open Source German Distant Speech Recognition: Corpus and Acoustic Model Dec 11, 2015 Distant Speech Recognition speech-recognition
Code Code Available 0Transforming faces into video stories -- VideoFace2.0 May 4, 2025 Face Detection Face Recognition
Code Code Available 0SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation Feb 27, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Emotional Speech Recognition with Pre-trained Deep Visual Models Apr 6, 2022 Emotion Recognition speech-recognition
Code Code Available 0A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC Videos Jul 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences Mar 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Convolutional Neural Network Language Models Nov 1, 2016 Document Classification General Classification
Code Code Available 0Adapting the adapters for code-switching in multilingual ASR Oct 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Optimal Completion Distillation for Sequence Learning Oct 2, 2018 Position speech-recognition
Code Code Available 0Fine-Grained Grounding for Multimodal Speech Recognition Oct 5, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Attention-based Multi-hypothesis Fusion for Speech Summarization Nov 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects Jun 27, 2024 Automatic Speech Recognition Machine Translation
Code Code Available 0A Morphology-aware Network for Morphological Disambiguation Feb 13, 2017 Deep Learning Feature Engineering
Code Code Available 0Rethinking Evaluation in ASR: Are Our Models Robust Enough? Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients May 27, 2024 Automatic Speech Recognition Federated Learning
Code Code Available 0Malware Makeover: Breaking ML-based Static Analysis by Modifying Executable Bytes Dec 19, 2019 Feature Engineering Malware Detection
Code Code Available 0Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets Oct 23, 2017 Benchmarking BIG-bench Machine Learning
Code Code Available 0SlothSpeech: Denial-of-service Attack Against Speech Recognition Models Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Optimized Speculative Sampling for GPU Hardware Accelerators Jun 16, 2024 Automatic Speech Recognition GPU
Code Code Available 0Streaming Sequence Transduction through Dynamic Compression Feb 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Federated Learning in ASR: Not as Easy as You Think Sep 30, 2021 Federated Learning speech-recognition
Code Code Available 0Mispronunciation detection using self-supervised speech representations Jul 30, 2023 Self-Supervised Learning speech-recognition
Code Code Available 0Convolutional Neural Network for Paraphrase Identification May 1, 2015 ARC Binary Classification
Code Code Available 0Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data Sep 22, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation Dec 6, 2019 Form Machine Translation
Code Code Available 0Optimizing Deep Learning Models For Raspberry Pi Apr 25, 2023 CPU Deep Learning
Code Code Available 0Fast-Slow Recurrent Neural Networks May 24, 2017 Language Modeling Language Modelling
Code Code Available 0Cascaded Cross-Modal Transformer for Audio-Textual Classification Jan 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Mixat: A Data Set of Bilingual Emirati-English Speech May 4, 2024 speech-recognition Speech Recognition
Code Code Available 0Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks Jan 10, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq May 25, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Advancing African-Accented Speech Recognition: Epistemic Uncertainty-Driven Data Selection for Generalizable ASR Models Jun 3, 2023 Accented Speech Recognition Active Learning
Code Code Available 0Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin Oct 21, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition Oct 27, 2023 Data Augmentation speech-recognition
Code Code Available 0Attention-Based Models for Text-Dependent Speaker Verification Oct 28, 2017 Image Captioning Machine Translation
Code Code Available 0Optimus: An Efficient Dynamic Resource Scheduler for Deep Learning Clusters Apr 26, 2018 CPU Deep Learning
Code Code Available 0Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview Aug 14, 2020 Data Augmentation Domain Adaptation
Code Code Available 0DiaCorrect: End-to-end error correction for speaker diarization Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Revealing and Protecting Labels in Distributed Training Oct 31, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Orthographic Transliteration for Kabyle Speech Recognition Nov 1, 2021 speech-recognition Speech Recognition
Code Code Available 0ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models Mar 29, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0Careless Whisper: Speech-to-Text Hallucination Harms Feb 12, 2024 Hallucination Language Modeling
Code Code Available 0AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR Jan 13, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction Sep 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Calibrated Structured Prediction Dec 1, 2015 Medical Diagnosis Optical Character Recognition
Code Code Available 0