Enhancing Monotonic Multihead Attention for Streaming ASR May 19, 2020 All Automatic Speech Recognition
Code Code Available 15 ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition Oct 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech Oct 1, 2023 speech-recognition Speech Recognition
Code Code Available 15 End-to-End Speech Recognition and Disfluency Removal Sep 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 End-to-End Speech Recognition from Federated Acoustic Models Apr 29, 2021 2k 4k
Code Code Available 15 Attention model for articulatory features detection Jul 2, 2019 Manner Of Articulation Detection model
Code Code Available 15 Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention Oct 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 AutoDiCE: Fully Automated Distributed CNN Inference at the Edge Jul 20, 2022 Code Generation image-classification
Code Code Available 15 Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation Nov 1, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 15 ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi Apr 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation Jul 26, 2024 Contrastive Learning speech-recognition
Code Code Available 15 Evaluating the visualization of what a Deep Neural Network has learned Sep 21, 2015 Classification General Classification
Code Code Available 15 Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention Jul 13, 2020 Automatic Lyrics Transcription speech-recognition
Code Code Available 15 Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning Mar 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Accented Speech Recognition With Accent-specific Codebooks Oct 24, 2023 Accented Speech Recognition Automatic Speech Recognition
Code Code Available 15 Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition Sep 21, 2023 speech-recognition Speech Recognition
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation Nov 17, 2024 Action Recognition backdoor defense
Code Code Available 15 EnCodecMAE: Leveraging neural codecs for universal audio representation learning Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attack on practical speaker verification system using universal adversarial perturbations May 19, 2021 Real-World Adversarial Attack Room Impulse Response (RIR)
Code Code Available 15 FlanEC: Exploring Flan-T5 for Post-ASR Error Correction Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Fully Differentiable Beam Search Decoder Feb 16, 2019 Decoder Language Modeling
Code Code Available 15 A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition May 20, 2020 speech-recognition Speech Recognition
Code Code Available 15 BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications Apr 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations Mar 25, 2022 Federated Learning Quantization
Code Code Available 15 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition Apr 7, 2022 Mixture-of-Experts speech-recognition
Code Code Available 15 Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement Jun 6, 2024 Diversity Speech Enhancement
Code Code Available 15 BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 15 Generalizing in the Real World with Representation Learning Oct 18, 2022 Drug Discovery Representation Learning
Code Code Available 15 Attention-Based Models for Speech Recognition Jun 24, 2015 Machine Translation Phoneme Recognition
Code Code Available 15 End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings Apr 8, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 BIG-C: a Multimodal Multi-Purpose Dataset for Bemba May 26, 2023 Machine Translation speech-recognition
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 Jul 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Emotion Recognition in Audio and Video Using Deep Neural Networks Jun 15, 2020 Deep Learning Emotion Recognition
Code Code Available 15 A Discriminative Hierarchical PLDA-based Model for Spoken Language Recognition Jan 4, 2022 Machine Translation speech-recognition
Code Code Available 15 Adversarial Attacks against Windows PE Malware Detection: A Survey of the State-of-the-Art Dec 23, 2021 Adversarial Attack Malware Detection
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BrainBERT: Self-supervised representation learning for intracranial recordings Feb 28, 2023 Language Modeling Language Modelling
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15