LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring Apr 6, 2021 ARC Automatic Speech Recognition
Code Code Available 0Writer adaptation for offline text recognition: An exploration of neural network-based methods Jul 11, 2023 Automatic Speech Recognition Handwriting Recognition
Code Code Available 0Using Rule-Based Labels for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction Dec 7, 2017 Property Prediction speech-recognition
Code Code Available 0Written Term Detection Improves Spoken Term Detection Jul 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems Sep 13, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Recurrent DNNs and its Ensembles on the TIMIT Phone Recognition Task Jun 19, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech Oct 30, 2018 Speech Recognition Voice Conversion
Code Code Available 0Online and Linear-Time Attention by Enforcing Monotonic Alignments Apr 3, 2017 Machine Translation Sentence
Code Code Available 0Transformer-Based Approaches for Automatic Music Transcription Feb 12, 2021 Language Modelling Music Transcription
Code Code Available 0Vocoder-free End-to-End Voice Conversion with Transformer Network Feb 5, 2020 speech-recognition Speech Recognition
Code Code Available 0Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models Jan 15, 2024 Data Compression image-classification
Code Code Available 0Recurrent Neural Network-Based Semantic Variational Autoencoder for Sequence-to-Sequence Learning Feb 9, 2018 Imputation Language Modeling
Code Code Available 0Data augmentation using prosody and false starts to recognize non-native children's speech Aug 29, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands Sep 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Recurrent Neural Network Regularization Sep 8, 2014 Caption Generation Image Captioning
Code Code Available 0Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language May 1, 2020 Language Modeling speech-recognition
Code Code Available 0End-to-end Audiovisual Speech Recognition Feb 18, 2018 Lipreading speech-recognition
Code Code Available 0Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference Dec 15, 2023 Quantization speech-recognition
Code Code Available 0Star Temporal Classification: Sequence Classification with Partially Labeled Data Jan 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Recurrent Neural Network Transducer for Audio-Visual Speech Recognition Nov 8, 2019 Audio-Visual Speech Recognition Lipreading
Code Code Available 0Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription Apr 22, 2020 Data Augmentation Speech Enhancement
Code Code Available 0On Monotonic Aggregation for Open-domain QA Aug 8, 2023 Language Modeling Language Modelling
Code Code Available 0Generating gender-ambiguous voices for privacy-preserving speech recognition Jul 3, 2022 Attribute Generative Adversarial Network
Code Code Available 0Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity Nov 2, 2021 Cross-Lingual Transfer speech-recognition
Code Code Available 0On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0ADIMA: Abuse Detection In Multilingual Audio Feb 16, 2022 Abuse Detection Automatic Speech Recognition
Code Code Available 0Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition Aug 17, 2024 Language Modeling Language Modelling
Code Code Available 0Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment Jul 6, 2023 Speaker Identification speech-recognition
Code Code Available 0Game of Gradients: Mitigating Irrelevant Clients in Federated Learning Oct 23, 2021 Federated Learning image-classification
Code Code Available 0A Simplified Fully Quantized Transformer for End-to-end Speech Recognition Nov 9, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0RED-ACE: Robust Error Detection for ASR using Confidence Embeddings Mar 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions Oct 1, 2019 speech-recognition Speech Recognition
Code Code Available 0Character-Level Incremental Speech Recognition with Recurrent Neural Networks Jan 25, 2016 Language Modeling Language Modelling
Code Code Available 0Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention Oct 23, 2020 speech-recognition Speech Recognition
Code Code Available 0Analysis of French Phonetic Idiosyncrasies for Accent Recognition Oct 18, 2021 Multi-class Classification speech-recognition
Code Code Available 0Audiovisual Speaker Tracking using Nonlinear Dynamical Systems with Dynamic Stream Weights Mar 14, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Onssen: an open-source speech separation and enhancement library Nov 3, 2019 Deep Clustering speech-recognition
Code Code Available 0End-to-End Attention-based Large Vocabulary Speech Recognition Aug 18, 2015 Acoustic Modelling Language Modeling
Code Code Available 0Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization Oct 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Design Methodology for Efficient Implementation of Deconvolutional Neural Networks on an FPGA May 7, 2017 CPU Denoising
Code Code Available 0Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching Apr 15, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0ASR Benchmarking: Need for a More Representative Conversational Dataset Sep 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Audio Segmentation for Robust Real-Time Speech Recognition Based on Neural Networks Dec 1, 2016 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Transformer-Based Named Entity Recognition for Automated Server Provisioning Apr 1, 2025 named-entity-recognition Named Entity Recognition
Code Code Available 0A Deep Relevance Matching Model for Ad-hoc Retrieval Nov 23, 2017 Ad-Hoc Information Retrieval Paraphrase Identification
Code Code Available 0Transformer Based Punctuation Restoration for Turkish Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible Jul 30, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition Nov 1, 2018 Data Augmentation Language Identification
Code Code Available 0BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech Recognition Apr 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0