A Toolkit for Efficient Learning of Lexical Units for Speech Recognition May 1, 2014 Information Retrieval Language Modeling
Code Code Available 0When Is TTS Augmentation Through a Pivot Language Useful? Jul 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0NeMo Inverse Text Normalization: From Development To Production Apr 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0I3D: Transformer architectures with input-dependent dynamic depth for speech recognition Mar 14, 2023 Model Compression speech-recognition
Code Code Available 0ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit Oct 24, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0DSD: Dense-Sparse-Dense Training for Deep Neural Networks Jul 15, 2016 8k Caption Generation
Code Code Available 0HydraFormer: One Encoder For All Subsampling Rates Aug 8, 2024 All Automatic Speech Recognition
Code Code Available 0Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition Jul 10, 2018 Object Object Recognition
Code Code Available 0Do You Act Like You Talk? Exploring Pose-based Driver Action Classification with Speech Recognition Networks Jul 15, 2024 Action Classification Data Augmentation
Code Code Available 0Whose Emotion Matters? Speaking Activity Localisation without Prior Knowledge Nov 23, 2022 Active Speaker Detection Automatic Speech Recognition
Code Code Available 0Neural Architecture Search: A Survey Aug 16, 2018 Machine Translation Neural Architecture Search
Code Code Available 0Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks Jan 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition Nov 16, 2021 Phoneme Recognition Representation Learning
Code Code Available 0Neural Architecture Search: Insights from 1000 Papers Jan 20, 2023 Natural Language Understanding Neural Architecture Search
Code Code Available 0Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training Apr 16, 2024 Language Modeling Language Modelling
Code Code Available 0Teaching Wav2Vec2 the Language of the Brain Jan 16, 2025 Brain Decoding speech-recognition
Code Code Available 0Hybrid phonetic-neural model for correction in speech recognition systems Feb 12, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Theory of Unsupervised Speech Recognition Jun 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Target-Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning Apr 8, 2019 Face Recognition Image Classification
Code Code Available 0A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AI Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Recognition with Deep Recurrent Neural Networks Mar 22, 2013 Handwriting Recognition Phoneme Recognition
Code Code Available 0Lend a Hand: Semi Training-Free Cued Speech Recognition via MLLM-Driven Hand Modeling for Barrier-free Communication Mar 11, 2025 Lip Reading Prompt Engineering
Code Code Available 0Self-supervised Speech Representations Still Struggle with African American Vernacular English Aug 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Decoding P300 Variability using Convolutional Neural Networks Jun 14, 2019 EEG Eeg Decoding
Code Code Available 0A Dataset for Speech Emotion Recognition in Greek Theatrical Plays Mar 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Self-Train Before You Transcribe Jun 17, 2024 Domain Adaptation Language Modelling
Code Code Available 0Self-training and Pre-training are Complementary for Speech Recognition Oct 22, 2020 speech-recognition Speech Recognition
Code Code Available 0A Survey of Recent DNN Architectures on the TIMIT Phone Recognition Task Jun 19, 2018 speech-recognition Speech Recognition
Code Code Available 0Hybrid Macro/Micro Level Backpropagation for Training Deep Spiking Neural Networks May 21, 2018 Image Classification speech-recognition
Code Code Available 0HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism Mar 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Letter-Based Speech Recognition with Gated ConvNets Dec 22, 2017 Decoder Language Modeling
Code Code Available 0Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition Nov 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Bigger is not Always Better: The Effect of Context Size on Speech Pre-Training Dec 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and Subtitling Feb 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Understanding on Tiny Devices with A Learning Cache Nov 30, 2023 speech-recognition Speech Recognition
Code Code Available 0DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0TinyML for Speech Recognition Apr 22, 2025 speech-recognition Speech Recognition
Code Code Available 0Bidirectional Quaternion Long-Short Term Memory Recurrent Neural Networks for Speech Recognition Nov 6, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Semantically Corrected Amharic Automatic Speech Recognition Apr 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Semantically Meaningful Metrics for Norwegian ASR Systems Sep 3, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR Jan 6, 2024 Active Learning Automatic Speech Recognition
Code Code Available 0Transcription free filler word detection with Neural semi-CRFs Mar 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Combination of Convolutional and Recurrent Neural Network for Sentiment Analysis of Short Texts Dec 1, 2016 Information Retrieval Sentiment Analysis
Code Code Available 0Neural network based spectral mask estimation for acoustic beamforming Mar 20, 2016 speech-recognition Speech Recognition
Code Code Available 0Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper Jun 9, 2024 speech-recognition Speech Recognition
Code Code Available 0Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof May 1, 2016 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Semantic Mask for Transformer based End-to-End Speech Recognition Dec 6, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN Jul 24, 2023 Automatic Speech Recognition Sentiment Analysis
Code Code Available 0Neural NILM: Deep Neural Networks Applied to Energy Disaggregation Jul 23, 2015 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0