Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator May 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation Dec 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition Apr 8, 2022 speech-recognition Speech Recognition
Code Code Available 05 A Unified Speaker Adaptation Approach for ASR Oct 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition Apr 13, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM May 24, 2023 Language Modelling Question Answering
Code Code Available 05 Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding Feb 10, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Geometric deep learning on graphs and manifolds using mixture model CNNs Nov 25, 2016 Deep Learning Document Classification
Code Code Available 05 Generating gender-ambiguous voices for privacy-preserving speech recognition Jul 3, 2022 Attribute Generative Adversarial Network
Code Code Available 05 Long-term Conversation Analysis: Exploring Utility and Privacy Jun 28, 2023 Action Detection Activity Detection
Code Code Available 05 Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition Aug 17, 2024 Language Modeling Language Modelling
Code Code Available 05 Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech Oct 30, 2018 Speech Recognition Voice Conversion
Code Code Available 05 Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment Jul 6, 2023 Speaker Identification speech-recognition
Code Code Available 05 LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring Apr 6, 2021 ARC Automatic Speech Recognition
Code Code Available 05 A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition Mar 7, 2024 Audio-Visual Speech Recognition Knowledge Distillation
Code Code Available 05 Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 AI-Generated Song Detection via Lyrics Transcripts Jun 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 From Gameplay to Symbolic Reasoning: Learning SAT Solver Heuristics in the Style of Alpha(Go) Zero Feb 14, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 05 A Simplified Fully Quantized Transformer for End-to-end Speech Recognition Nov 9, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Game of Gradients: Mitigating Irrelevant Clients in Federated Learning Oct 23, 2021 Federated Learning image-classification
Code Code Available 05 FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech May 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding Jan 10, 2025 Automatic Speech Recognition Classification
Code Code Available 05 FlowSense: Monitoring Airflow in Building Ventilation Systems Using Audio Sensing Feb 22, 2022 Privacy Preserving speech-recognition
Code Code Available 05 First Automatic Fongbe Continuous Speech Recognition System: Development of Acoustic Models and Language Models Jan 21, 2017 Language Modeling Language Modelling
Code Code Available 05 Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts Jun 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 AI4D -- African Language Program Apr 6, 2021 Machine Translation speech-recognition
Code Code Available 05 Finnish Parliament ASR corpus - Analysis, benchmarks and statistics Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs Aug 12, 2014 Language Modeling Language Modelling
Code Code Available 05 Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients May 27, 2024 Automatic Speech Recognition Federated Learning
Code Code Available 05 Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq May 25, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Automatic Dialect Detection in Arabic Broadcast Speech Sep 23, 2015 Dialect Identification Language Identification
Code Code Available 05 Fine-Grained Grounding for Multimodal Speech Recognition Oct 5, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Federated Learning in ASR: Not as Easy as You Think Sep 30, 2021 Federated Learning speech-recognition
Code Code Available 05 Fast-Slow Recurrent Neural Networks May 24, 2017 Language Modeling Language Modelling
Code Code Available 05 On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition Feb 5, 2019 Decoder Language Modeling
Code Code Available 05 FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization Oct 21, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion Jul 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study Mar 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Improving Children's Speech Recognition by Fine-tuning Self-supervised Adult Speech Representations Nov 14, 2022 Self-Supervised Learning speech-recognition
Code Code Available 05 Extended Bit-Plane Compression for Convolutional Neural Network Accelerators Oct 1, 2018 image-classification Image Classification
Code Code Available 05 Exploring TTS without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020) May 11, 2020 Clustering speech-recognition
Code Code Available 05 Leveraging Cross-Lingual Transfer Learning in Spoken Named Entity Recognition Systems Jul 3, 2023 Cross-Lingual Transfer named-entity-recognition
Code Code Available 05 Analysis of French Phonetic Idiosyncrasies for Accent Recognition Oct 18, 2021 Multi-class Classification speech-recognition
Code Code Available 05 Exploring Generative Error Correction for Dysarthric Speech Recognition May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks Apr 22, 2024 speech-recognition Speech Recognition
Code Code Available 05 Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization Feb 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Exploiting Adapters for Cross-lingual Low-resource Speech Recognition May 18, 2021 Cross-Lingual ASR General Knowledge
Code Code Available 05 Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners Apr 8, 2022 Prediction Speech Enhancement
Code Code Available 05 Exploring spectro-temporal features in end-to-end convolutional neural networks Jan 1, 2019 speech-recognition Speech Recognition
Code Code Available 05