Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks May 5, 2023 Automatic Speech Recognition Cultural Vocal Bursts Intensity Prediction
Code Code Available 05 A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors Nov 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit Oct 24, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation Apr 8, 2019 speech-recognition Speech Recognition
Code Code Available 05 A Probabilistic Theory of Deep Learning Apr 2, 2015 Deep Learning Object
Code Code Available 05 Error-preserving Automatic Speech Recognition of Young English Learners' Language Jun 5, 2024 Automatic Speech Recognition Language Modelling
Code Code Available 05 Evaluation of End-to-End Continuous Spanish Lipreading in Different Data Conditions Feb 1, 2025 Lipreading speech-recognition
Code Code Available 05 Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network Jun 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes Aug 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Adversarial Example Detection by Classification for Deep Speech Recognition Oct 22, 2019 Classification General Classification
Code Code Available 05 End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator Oct 29, 2022 intent-classification Intent Classification
Code Code Available 05 Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Dec 8, 2020 Attribute Disentanglement
Code Code Available 05 End-to-End Speech Recognition From the Raw Waveform Jun 19, 2018 speech-recognition Speech Recognition
Code Code Available 05 End-to-End Speech Recognition and Disfluency Removal with Acoustic Language Model Pretraining Sep 8, 2023 Language Modeling Language Modelling
Code Code Available 05 End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model Mar 12, 2019 Data Augmentation speech-recognition
Code Code Available 05 Enhancing Quantised End-to-End ASR Models via Personalisation Sep 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks Jun 12, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 End-to-End Attention-based Large Vocabulary Speech Recognition Aug 18, 2015 Acoustic Modelling Language Modeling
Code Code Available 05 End-to-end Audiovisual Speech Recognition Feb 18, 2018 Lipreading speech-recognition
Code Code Available 05 End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures Nov 19, 2019 Language Modeling Language Modelling
Code Code Available 05 End to End ASR System with Automatic Punctuation Insertion Dec 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Emotional Speech Recognition with Pre-trained Deep Visual Models Apr 6, 2022 Emotion Recognition speech-recognition
Code Code Available 05 End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands Sep 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Efficient Keyword Spotting by capturing long-range interactions with Temporal Lambda Networks Apr 16, 2021 Keyword Spotting speech-recognition
Code Code Available 05 ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models Mar 29, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 05 Application of Word2vec in Phoneme Recognition Dec 17, 2019 Phoneme Recognition speech-recognition
Code Code Available 05 Efficient and Generic 1D Dilated Convolution Layer for Deep Learning Apr 16, 2021 CPU Deep Learning
Code Code Available 05 Efficient Ensemble for Multimodal Punctuation Restoration using Time-Delay Neural Network Feb 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings Sep 10, 2024 Automatic Speech Recognition Diversity
Code Code Available 05 Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks Dec 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction Oct 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators Aug 30, 2019 image-classification Image Classification
Code Code Available 05 EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding Jul 29, 2015 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech Tasks Aug 28, 2023 Speech Recognition
Code Code Available 05 Dysarthria Normalization via Local Lie Group Transformations for Robust ASR Apr 16, 2025 Robust Speech Recognition speech-recognition
Code Code Available 05 EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition Apr 13, 2021 Language Modeling Language Modelling
Code Code Available 05 Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced Languages Feb 8, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Jan 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 DSD: Dense-Sparse-Dense Training for Deep Neural Networks Jul 15, 2016 8k Caption Generation
Code Code Available 05 Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge Jul 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper Jun 9, 2024 speech-recognition Speech Recognition
Code Code Available 05 Do You Act Like You Talk? Exploring Pose-based Driver Action Classification with Speech Recognition Networks Jul 15, 2024 Action Classification Data Augmentation
Code Code Available 05 Efficient Adaptation of Multilingual Models for Japanese ASR Dec 14, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 05 End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations Aug 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization Oct 21, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Distributed Learning of Deep Neural Networks using Independent Subnet Training Oct 4, 2019 BIG-bench Machine Learning Image Classification
Code Code Available 05 DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation Apr 7, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Disentangling Speech and Non-Speech Components for Building Robust Acoustic Models from Found Data Sep 25, 2019 speech-recognition Speech Recognition
Code Code Available 05 An Overview of Multi-Task Learning in Deep Neural Networks Jun 15, 2017 BIG-bench Machine Learning Drug Discovery
Code Code Available 05 Do Deep Nets Really Need to be Deep? Dec 21, 2013 Phoneme Recognition speech-recognition
Code Code Available 05