AI4D -- African Language Program Apr 6, 2021 Machine Translation speech-recognition
Code Code Available 0Multi-Stage Speaker Diarization for Noisy Classrooms May 16, 2025 Action Detection Activity Detection
Code Code Available 0Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation Sep 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0fairseq S2T: Fast Speech-to-Text Modeling with fairseq Oct 11, 2020 Machine Translation Multi-Task Learning
Code Code Available 0Extended Bit-Plane Compression for Convolutional Neural Network Accelerators Oct 1, 2018 image-classification Image Classification
Code Code Available 0Detecting Adversarial Examples for Speech Recognition via Uncertainty Quantification May 24, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Exploring TTS without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020) May 11, 2020 Clustering speech-recognition
Code Code Available 0Leveraging Cross-Lingual Transfer Learning in Spoken Named Entity Recognition Systems Jul 3, 2023 Cross-Lingual Transfer named-entity-recognition
Code Code Available 0A Small and Fast BERT for Chinese Medical Punctuation Restoration Aug 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Active Learning for Classifying 2D Grid-Based Level Completability Sep 8, 2023 Active Learning speech-recognition
Code Code Available 0A Transformer with Interleaved Self-attention and Convolution for Hybrid Acoustic Models Oct 23, 2019 speech-recognition Speech Recognition
Code Code Available 0Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech Jun 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Exploring spectro-temporal features in end-to-end convolutional neural networks Jan 1, 2019 speech-recognition Speech Recognition
Code Code Available 0Efficient Adaptation of Multilingual Models for Japanese ASR Dec 14, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features Nov 28, 2017 Action Recognition Activity Recognition
Code Code Available 0Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition Apr 12, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Gentle Tutorial of Recurrent Neural Network with Error Backpropagation Oct 8, 2016 Handwriting Recognition Image to text
Code Code Available 0Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks Apr 22, 2024 speech-recognition Speech Recognition
Code Code Available 0Exploring Generative Error Correction for Dysarthric Speech Recognition May 26, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Iterative Pseudo-Labeling for Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Twin Networks: Matching the Future for Sequence Generation Aug 22, 2017 Caption Generation speech-recognition
Code Code Available 0Robustness Analysis of Deep Learning Frameworks on Mobile Platforms Sep 20, 2021 BIG-bench Machine Learning Deep Learning
Code Code Available 0A Speech Representation Anonymization Framework via Selective Noise Perturbation Mar 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Twin Regularization for online speech recognition Apr 15, 2018 speech-recognition Speech Recognition
Code Code Available 0Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition Oct 23, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 0A Generalized Language Model as the Combination of Skipped n-grams and Modified Kneser Ney Smoothing Jun 1, 2014 Language Modeling Language Modelling
Code Code Available 0Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit Poetry May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Keyphrase Cloud Generation of Broadcast News Jun 19, 2013 Keyphrase Extraction speech-recognition
Code Code Available 0Submodular Rank Aggregation on Score-based Permutations for Distributed Automatic Speech Recognition Jan 27, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment Dec 1, 2024 Action Detection Activity Detection
Code Code Available 0VenoMave: Targeted Poisoning Against Speech Recognition Oct 21, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation System May 29, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Investigating Weight-Perturbed Deep Neural Networks With Application in Iris Presentation Attack Detection Nov 21, 2023 image-classification Image Classification
Code Code Available 0Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners Apr 8, 2022 Prediction Speech Enhancement
Code Code Available 0Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information Apr 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Investigating the Emergent Audio Classification Ability of ASR Foundation Models Nov 15, 2023 Audio Classification Decoder
Code Code Available 0Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization Feb 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speaker-adaptive Lip Reading with User-dependent Padding Aug 9, 2022 Lip Reading speech-recognition
Code Code Available 0Automatic Speech Recognition and Query By Example for Creole Languages Documentation May 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Boosting Cross-Domain Speech Recognition with Self-Supervision Jun 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Exploiting Adapters for Cross-lingual Low-resource Speech Recognition May 18, 2021 Cross-Lingual ASR General Knowledge
Code Code Available 0Muddling Label Regularization: Deep Learning for Tabular Datasets Jun 8, 2021 Deep Learning Memorization
Code Code Available 0Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder Oct 6, 2023 Alzheimer's Disease Detection speech-recognition
Code Code Available 0Explaining Spectrograms in Machine Learning: A Study on Neural Networks for Speech Classification Jul 10, 2024 Classification speech-recognition
Code Code Available 0Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation Jan 7, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 0ASL Trigger Recognition in Mixed Activity/Signing Sequences for RF Sensor-Based User Interfaces Nov 10, 2021 Sign Language Recognition speech-recognition
Code Code Available 0Two-Pass End-to-End Speech Recognition Aug 29, 2019 speech-recognition Speech Recognition
Code Code Available 0The OCON model: an old but gold solution for distributable supervised classification Oct 5, 2024 Automatic Speech Recognition Classification
Code Code Available 0DELTA: A DEep learning based Language Technology plAtform Aug 2, 2019 Abstractive Text Summarization Deep Learning
Code Code Available 0