Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Open Source Automatic Speech Recognition for German Jul 26, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Zero-shot keyword spotting for visual speech recognition in-the-wild Jul 23, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Word Error Rate Estimation for Speech Recognition: e-WER Jul 1, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces May 25, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1RETURNN as a Generic Flexible Neural Toolkit with Application to Translation and Speech Recognition May 14, 2018 Decoder speech-recognition
Code Code Available 1Improved training of end-to-end attention models for speech recognition May 8, 2018 Language Modeling Language Modelling
Code Code Available 1Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition Apr 9, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension Apr 1, 2018 Question Answering Reading Comprehension
Code Code Available 1XNMT: The eXtensible Neural Machine Translation Toolkit Mar 1, 2018 Machine Translation NMT
Code Code Available 1Monotonic Chunkwise Attention Jan 1, 2018 Document Summarization speech-recognition
Code Code Available 1Monotonic Chunkwise Attention Dec 14, 2017 Document Summarization speech-recognition
Code Code Available 1Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models Dec 5, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1State-of-the-art Speech Recognition With Sequence-to-Sequence Models Dec 5, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments Oct 6, 2017 Articles Distant Speech Recognition
Code Code Available 1AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline Sep 16, 2017 speech-recognition Speech Recognition
Code Code Available 1A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 1Wav2Letter: an End-to-End ConvNet-based Speech Recognition System Sep 11, 2016 Speech Recognition
Code Code Available 1Single-Channel Multi-Speaker Separation using Deep Clustering Jul 7, 2016 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Communication-Efficient Learning of Deep Networks from Decentralized Data Feb 17, 2016 Federated Learning Speech Recognition
Code Code Available 1Deep Speech 2: End-to-End Speech Recognition in English and Mandarin Dec 8, 2015 Accented Speech Recognition Noisy Speech Recognition
Code Code Available 1Evaluating the visualization of what a Deep Neural Network has learned Sep 21, 2015 Classification General Classification
Code Code Available 1Listen, Attend and Spell Aug 5, 2015 Decoder Language Modeling
Code Code Available 1Attention-Based Models for Speech Recognition Jun 24, 2015 Machine Translation Phoneme Recognition
Code Code Available 1Deep Speech: Scaling up end-to-end speech recognition Dec 17, 2014 Accented Speech Recognition Speech Recognition
Code Code Available 1An exact mapping between the Variational Renormalization Group and Deep Learning Oct 14, 2014 Deep Learning speech-recognition
Code Code Available 1NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech Jul 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine Jul 17, 2025 Audio Classification Automatic Speech Recognition
— Unverified 0WhisperKit: On-device Real-time ASR with Billion-Scale Transformers Jul 14, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis Jul 8, 2025 Automatic Speech Recognition Lip Reading
— Unverified 0A Hybrid Machine Learning Framework for Optimizing Crop Selection via Agronomic and Economic Forecasting Jul 6, 2025 Hybrid Machine Learning speech-recognition
— Unverified 0First Steps Towards Voice Anonymization for Code-Switching Speech Jul 2, 2025 speech-recognition Speech Recognition
— Unverified 0Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR Jun 25, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AUTOMATIC PRONUNCIATION MISTAKE DETECTOR PROJECT REPORT Jun 25, 2025 Mistake Detection speech-recognition
— Unverified 0Multimodal Representation Learning and Fusion Jun 25, 2025 AutoML Representation Learning
— Unverified 0VOICE CONTROL ROBOT USING ARDUINO MANAGEMENT SYSTEM PROJECT. Jun 25, 2025 Management speech-recognition
— Unverified 0AI-Generated Song Detection via Lyrics Transcripts Jun 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-to-End Spoken Grammatical Error Correction Jun 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices Jun 22, 2025 Automatic Speech Recognition speech-recognition
Code Code Available 0OpusLM: A Family of Open Unified Speech Language Models Jun 21, 2025 Decoder speech-recognition
— Unverified 0State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition Jun 20, 2025 Automatic Speech Recognition Diversity
— Unverified 0Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages Jun 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization Jun 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Speech Recognition Biases in Newcastle English: an Error Analysis Jun 19, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Weight Factorization and Centralization for Continual Learning in Speech Recognition Jun 19, 2025 Continual Learning speech-recognition
— Unverified 0Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios Jun 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition Jun 17, 2025 Data Augmentation Language Modeling
— Unverified 0Unifying Streaming and Non-streaming Zipformer-based ASR Jun 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR Jun 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0