Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge Jul 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Error-preserving Automatic Speech Recognition of Young English Learners' Language Jun 5, 2024 Automatic Speech Recognition Language Modelling
Code Code Available 0Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding Oct 9, 2023 slot-filling Slot Filling
Code Code Available 0Domain Adaptation Using Class Similarity for Robust Speech Recognition Nov 5, 2020 Domain Adaptation Robust Speech Recognition
Code Code Available 0Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning Fusion Sep 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Neural Sentiment Classification with User and Product Attention Nov 1, 2016 Classification Feature Engineering
Code Code Available 0Does Joint Training Really Help Cascaded Speech Translation? Oct 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 03D Convolutional Neural Networks for Cross Audio-Visual Matching Recognition Jun 18, 2017 Speaker Verification speech-recognition
Code Code Available 0PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch Jul 12, 2019 speech-recognition Speech Recognition
Code Code Available 0Using Filter Banks in Convolutional Neural Networks for Texture Classification Jan 12, 2016 Classification General Classification
Code Code Available 0Human Transcription Quality Improvement Sep 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Survey of Deep Active Learning Aug 30, 2020 Active Learning speech-recognition
Code Code Available 0To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0CochCeps-Augment: A Novel Self-Supervised Contrastive Learning Using Cochlear Cepstrum-based Masking for Speech Emotion Recognition Feb 10, 2024 Contrastive Learning Emotion Recognition
Code Code Available 0Do Deep Nets Really Need to be Deep? Dec 21, 2013 Phoneme Recognition speech-recognition
Code Code Available 0HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition Apr 13, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition Mar 7, 2024 Audio-Visual Speech Recognition Knowledge Distillation
Code Code Available 0Bi-Directional Lattice Recurrent Neural Networks for Confidence Estimation Oct 30, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech-text based multi-modal training with bidirectional attention for improved speech recognition Nov 1, 2022 speech-recognition Speech Recognition
Code Code Available 0lex4all: A language-independent tool for building and evaluating pronunciation lexicons for small-vocabulary speech recognition Jun 1, 2014 speech-recognition Speech Recognition
Code Code Available 0Textless Speech-to-Speech Translation With Limited Parallel Data May 24, 2023 Automatic Speech Recognition Denoising
Code Code Available 0Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation Apr 8, 2019 speech-recognition Speech Recognition
Code Code Available 0The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024 Aug 5, 2024 Decoder speech-recognition
Code Code Available 0QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding Oct 7, 2016 image-classification Image Classification
Code Code Available 0Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks Sep 2, 2018 speech-recognition Speech Recognition
Code Code Available 0How You Say It Matters: Measuring the Impact of Verbal Disfluency Tags on Automated Dementia Detection May 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0How Phonotactics Affect Multilingual and Zero-shot ASR Performance Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword Spotting Oct 18, 2017 Keyword Spotting speech-recognition
Code Code Available 0Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers Oct 15, 2023 Decoder speech-recognition
Code Code Available 0HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation Jun 20, 2023 Cross-corpus Sentence
Code Code Available 0LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech Recognition Oct 17, 2019 Sentence speech-recognition
Code Code Available 0Light Gated Recurrent Units for Speech Recognition Mar 26, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0High-order Graph-based Neural Dependency Parsing Oct 1, 2015 Dependency Parsing Machine Translation
Code Code Available 0Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video Jun 5, 2015 Gesture Recognition Image Captioning
Code Code Available 0Hierarchical Text Generation using an Outline Oct 20, 2018 Dialogue Generation speech-recognition
Code Code Available 0Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition Apr 8, 2022 speech-recognition Speech Recognition
Code Code Available 0Beyond Levenshtein: Leveraging Multiple Algorithms for Robust Word Error Rate Computations And Granular Error Classifications Aug 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Enhancing Quantised End-to-End ASR Models via Personalisation Sep 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0NIESR: Nuisance Invariant End-to-end Speech Recognition Jul 7, 2019 speech-recognition Speech Recognition
Code Code Available 0Bayesian Learning for Deep Neural Network Adaptation Dec 14, 2020 speech-recognition Speech Recognition
Code Code Available 0Quantifying Bias in Automatic Speech Recognition Mar 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network Jun 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Beyond Instructional Videos: Probing for More Diverse Visual-Textual Grounding on YouTube Apr 29, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Lightweight Transducer Based on Frame-Level Criterion Sep 5, 2024 Decoder imbalanced classification
Code Code Available 0Text-Based Detection of On-Hold Scripts in Contact Center Calls Jul 13, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0Linear Time Complexity Conformers with SummaryMixing for Streaming Speech Recognition Sep 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling Sep 25, 2024 Automatic Speech Recognition Emotion Recognition
Code Code Available 0Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation Jun 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Quantization and Deployment of Deep Neural Networks on Microcontrollers May 27, 2021 Activity Recognition Human Activity Recognition
Code Code Available 0Quantization for OpenAI's Whisper Models: A Comparative Analysis Mar 12, 2025 Quantization speech-recognition
Code Code Available 0