Speech Translation Refinement using Large Language Models Jan 25, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Tools and resources for Romanian text-to-speech and speech-to-text applications Feb 15, 2018 speech-recognition Speech Recognition
Code Code Available 0SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training Oct 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition Jun 20, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0No More Mumbles: Enhancing Robot Intelligibility through Speech Adaptation May 15, 2024 speech-recognition Speech Recognition
Code Code Available 0End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator Oct 29, 2022 intent-classification Intent Classification
Code Code Available 0Harnessing GANs for Zero-shot Learning of New Classes in Visual Speech Recognition Jan 29, 2019 speech-recognition Speech Recognition
Code Code Available 0A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition Jul 27, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Quaternion Recurrent Neural Networks Jun 12, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes Aug 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers Oct 5, 2023 Decoder Logical Reasoning
Code Code Available 0LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild Nov 21, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 0Harnessing Evolution of Multi-Turn Conversations for Effective Answer Retrieval Dec 22, 2019 Retrieval speech-recognition
Code Code Available 0Word-level Embeddings for Cross-Task Transfer Learning in Speech Processing Oct 22, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Hardware Synthesis of State-Space Equations; Application to FPGA Implementation of Shallow and Deep Neural Networks May 15, 2021 speech-recognition Speech Recognition
Code Code Available 0Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation Apr 7, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR May 29, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Wikimedia: A 77 Language Multilingual Speech Dataset Aug 30, 2023 Machine Translation speech-recognition
Code Code Available 0Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition Jan 3, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 0Distributed Learning of Deep Neural Networks using Independent Subnet Training Oct 4, 2019 BIG-bench Machine Learning Image Classification
Code Code Available 0Disentangling Speech and Non-Speech Components for Building Robust Acoustic Models from Found Data Sep 25, 2019 speech-recognition Speech Recognition
Code Code Available 0Improving Non-Intrusive Load Disaggregation through an Attention-Based Deep Neural Network Nov 15, 2019 Decoder Denoising
Code Code Available 0A Comparative Study on Transformer vs RNN in Speech Applications Sep 13, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks Aug 31, 2018 Speech Enhancement Speech Recognition
Code Code Available 0Textless Dependency Parsing by Labeled Sequence Prediction Jul 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0WiSeBE: Window-based Sentence Boundary Evaluation Aug 27, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0RadioTalk: a large-scale corpus of talk radio transcripts Jul 16, 2019 Descriptive speech-recognition
Code Code Available 0Random Directional Attack for Fooling Deep Neural Networks Aug 6, 2019 speech-recognition Speech Recognition
Code Code Available 0Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic Information Jul 21, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 0Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments Jul 1, 2017 speech-recognition Speech Recognition
Code Code Available 0Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation Dec 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition Aug 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition Apr 5, 2021 speech-recognition Speech Recognition
Code Code Available 0Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator May 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model Mar 12, 2019 Data Augmentation speech-recognition
Code Code Available 0LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition Dec 5, 2022 Position speech-recognition
Code Code Available 0Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts Jun 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM May 24, 2023 Language Modelling Question Answering
Code Code Available 0Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition Jul 9, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Data Quality Measures and Efficient Evaluation Algorithms for Large-Scale High-Dimensional Data Jan 5, 2021 BIG-bench Machine Learning speech-recognition
Code Code Available 0BanglaDialecto: An End-to-End AI-Powered Regional Speech Standardization Nov 16, 2024 Machine Translation speech-recognition
Code Code Available 0Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning Sep 16, 2022 Natural Language Understanding reinforcement-learning
Code Code Available 0RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains Jan 28, 2025 Decision Making speech-recognition
Code Code Available 0CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR May 24, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises Feb 14, 2023 Data Augmentation Fairness
Code Code Available 0Seq2seq for Automatic Paraphasia Detection in Aphasic Speech Dec 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Geometric deep learning on graphs and manifolds using mixture model CNNs Nov 25, 2016 Deep Learning Document Classification
Code Code Available 0