EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition Apr 13, 2021 Language Modeling Language Modelling
Code Code Available 0Comparison and Analysis of New Curriculum Criteria for End-to-End ASR Aug 10, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification Aug 22, 2023 Self-Supervised Learning Speaker Identification
Code Code Available 0Improved Speech Enhancement with the Wave-U-Net Nov 27, 2018 Audio Source Separation Speech Enhancement
Code Code Available 0A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition Nov 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing Jan 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models Oct 30, 2022 Emotion Classification Self-Supervised Learning
Code Code Available 0Segmentation-Free Streaming Machine Translation Sep 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Training dynamic models using early exits for automatic speech recognition on resource-constrained devices Sep 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Training Efficient CNNS: Tweaking the Nuts and Bolts of Neural Networks for Lighter, Faster and Robust Models May 23, 2022 Data Augmentation Information Retrieval
Code Code Available 0Selective Attention Merging for low resource tasks: A case study of Child ASR Jan 14, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling Feb 7, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0An Automatic Speech Recognition System for Bengali Language based on Wav2Vec2 and Transfer Learning Sep 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks May 5, 2023 Automatic Speech Recognition Cultural Vocal Bursts Intensity Prediction
Code Code Available 0Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia Jun 10, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0CoMFLP: Correlation Measure based Fast Search on ASR Layer Pruning Sep 21, 2023 speech-recognition Speech Recognition
Code Code Available 0Learning Human Pose Estimation Features with Convolutional Networks Dec 27, 2013 Object Recognition Pose Estimation
Code Code Available 0Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition Jan 22, 2019 Classification Decoder
Code Code Available 0Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech Recognition Mar 30, 2022 All Automatic Speech Recognition
Code Code Available 0Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings Sep 10, 2024 Automatic Speech Recognition Diversity
Code Code Available 0Probing Acoustic Representations for Phonetic Properties Oct 25, 2020 Benchmarking speech-recognition
Code Code Available 0Acoustic absement in detail: Quantifying acoustic differences across time-series representations of speech data Apr 12, 2023 Dynamic Time Warping speech-recognition
Code Code Available 0Targeted Adversarial Examples for Black Box Audio Systems May 20, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0ImportantAug: a data augmentation agent for speech Dec 14, 2021 Data Augmentation Keyword Spotting
Code Code Available 0Combining Residual Networks with LSTMs for Lipreading Mar 12, 2017 Lipreading Lip Reading
Code Code Available 0Self-Powered LLM Modality Expansion for Large Speech-Text Models Oct 4, 2024 Automatic Speech Recognition Instruction Following
Code Code Available 0Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Jan 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems May 9, 2024 Audio-Visual Speech Recognition Lipreading
Code Code Available 0Speech Recognition Challenge in the Wild: Arabic MGB-3 Sep 21, 2017 Arabic Speech Recognition Dialect Identification
Code Code Available 0Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms Jun 12, 2025 Automatic Speech Recognition Keyword Spotting
Code Code Available 0ProGRes: Prompted Generative Rescoring on ASR n-Best Aug 30, 2024 speech-recognition Speech Recognition
Code Code Available 0Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition Mar 18, 2019 Decoder Handwritten Text Recognition
Code Code Available 0Learning Optimal Data Augmentation Policies via Bayesian Optimization for Image Classification Tasks May 6, 2019 Bayesian Optimization Data Augmentation
Code Code Available 0DeepEMO: Deep Learning for Speech Emotion Recognition Sep 9, 2021 Deep Learning Emotion Recognition
Code Code Available 0Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM Jun 8, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset Nov 1, 2021 Event Detection Retrieval
Code Code Available 0DeepCover: Advancing RNN Test Coverage and Online Error Prediction using State Machine Extraction Feb 10, 2024 Decision Making speech-recognition
Code Code Available 0Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition Oct 24, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Evaluating Gammatone Frequency Cepstral Coefficients with Neural Networks for Emotion Recognition from Speech Jun 23, 2018 Classification Emotion Recognition
Code Code Available 0Deep convolutional acoustic word embeddings using word-pair side information Oct 5, 2015 speech-recognition Speech Recognition
Code Code Available 0Task Loss Estimation for Sequence Prediction Nov 19, 2015 Decoder Language Modeling
Code Code Available 0Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition Jan 4, 2024 Attribute Automatic Speech Recognition
Code Code Available 0Train Like a (Var)Pro: Efficient Training of Neural Networks with Variable Projection Jul 26, 2020 image-classification Image Classification
Code Code Available 0Evaluating context-invariance in unsupervised speech representations Oct 27, 2022 Language Modelling speech-recognition
Code Code Available 0Learning to adapt: a meta-learning approach for speaker adaptation Aug 30, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition Mar 22, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Learning to detect dysarthria from raw speech Nov 27, 2018 General Classification Sentence
Code Code Available 0Dysarthria Normalization via Local Lie Group Transformations for Robust ASR Apr 16, 2025 Robust Speech Recognition speech-recognition
Code Code Available 0Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models Jul 16, 2024 Attribute Speaker Identification
Code Code Available 0Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition Jul 30, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0