Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices Jun 22, 2025 Automatic Speech Recognition speech-recognition
Code Code Available 0Realizing Petabyte Scale Acoustic Modeling Apr 24, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SpokeN-100: A Cross-Lingual Benchmarking Dataset for The Classification of Spoken Numbers in Different Languages Mar 14, 2024 Benchmarking Dimensionality Reduction
Code Code Available 0Long short-term memory and learning-to-learn in networks of spiking neurons Mar 26, 2018 Reinforcement Learning Sequential Image Classification
Code Code Available 0Sequence Labeling Approach to the Task of Sentence Boundary Detection Jan 20, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition Feb 5, 2014 Handwriting Recognition Language Modeling
Code Code Available 0End-to-End Speech Recognition From the Raw Waveform Jun 19, 2018 speech-recognition Speech Recognition
Code Code Available 0Spoken English Intelligibility Remediation with PocketSphinx Alignment and Feature Extraction Improves Substantially over the State of the Art Sep 6, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Long-term Conversation Analysis: Exploring Utility and Privacy Jun 28, 2023 Action Detection Activity Detection
Code Code Available 0Real-time low-resource phoneme recognition on edge devices Mar 25, 2021 Phoneme Recognition speech-recognition
Code Code Available 0Discrete Speech Unit Extraction via Independent Component Analysis Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0TF-LM: TensorFlow-based Language Modeling Toolkit May 1, 2018 Language Modeling Language Modelling
Code Code Available 0Thai Wav2Vec2.0 with CommonVoice V8 Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Sequence Modeling via Segmentations Feb 24, 2017 Segmentation speech-recognition
Code Code Available 0Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Spoken Language Intent Detection using Confusion2Vec Apr 7, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Transfer Learning for Speech Recognition on a Budget Jun 1, 2017 GPU speech-recognition
Code Code Available 0OkwuGbé: End-to-End Speech Recognition for Fon and Igbo Mar 13, 2021 Machine Translation speech-recognition
Code Code Available 0Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation Oct 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0OLISIA: a Cascade System for Spoken Dialogue State Tracking Apr 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-to-End Speech Recognition and Disfluency Removal with Acoustic Language Model Pretraining Sep 8, 2023 Language Modeling Language Modelling
Code Code Available 0Low Frequency Adversarial Perturbation Sep 24, 2018 Denoising Speech Recognition
Code Code Available 0Chemception: A Deep Neural Network with Minimal Chemistry Knowledge Matches the Performance of Expert-developed QSAR/QSPR Models Jun 20, 2017 Computational chemistry Deep Learning
Code Code Available 0Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection May 22, 2020 Decoder Sequence-To-Sequence Speech Recognition
Code Code Available 0End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations Aug 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A low latency attention module for streaming self-supervised speech representation learning Feb 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation Nov 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition Jan 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0THCHS-30 : A Free Chinese Speech Corpus Dec 7, 2015 speech-recognition Speech Recognition
Code Code Available 0Sequence-to-Sequence Modeling for Action Identification at High Temporal Resolution Nov 3, 2021 Action Recognition speech-recognition
Code Code Available 0Sequence-to-Sequence Models Can Directly Translate Foreign Speech Mar 24, 2017 Decoder Machine Translation
Code Code Available 0ChatGPT in the context of precision agriculture data analytics Nov 10, 2023 Language Modelling speech-recognition
Code Code Available 0SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR Dec 7, 2024 Automatic Speech Recognition Data Augmentation
Code Code Available 0Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition Sep 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors Nov 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0On-Device Neural Language Model Based Word Prediction Aug 1, 2018 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 0Sequence Transduction with Recurrent Neural Networks Nov 14, 2012 Machine Translation Phoneme Recognition
Code Code Available 0Transfer Learning from Visual Speech Recognition to Mouthing Recognition in German Sign Language May 20, 2025 Multi-Task Learning Sign Language Recognition
Code Code Available 0SSR7000: A Synchronized Corpus of Ultrasound Tongue Imaging for End-to-End Silent Speech Recognition Jun 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0LRS3-TED: a large-scale dataset for visual speech recognition Sep 3, 2018 Audio-Visual Speech Recognition speech-recognition
Code Code Available 0Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition Nov 6, 2018 Generative Adversarial Network Speech Enhancement
Code Code Available 0LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild Oct 16, 2018 Lipreading Lip Reading
Code Code Available 0Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project Apr 5, 2016 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Sequential Randomized Smoothing for Adversarially Robust Speech Recognition Nov 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Sequential Routing Framework: Fully Capsule Network-based Speech Recognition Jul 23, 2020 speech-recognition Speech Recognition
Code Code Available 0LSTM: A Search Space Odyssey Mar 13, 2015 CPU Handwriting Recognition
Code Code Available 0LSTM Benchmarks for Deep Learning Frameworks Jun 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain Feb 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition Dec 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0