Biometrics Recognition Using Deep Learning: A Survey Nov 30, 2019 Deep Learning Gait Recognition
Code Code Available 0Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0 Sep 27, 2022 Cultural Vocal Bursts Intensity Prediction Speech Recognition
Code Code Available 0Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks Oct 25, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Conditional independence for pretext task selection in Self-supervised speech representation learning Apr 15, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model Sep 6, 2021 speech-recognition Speech Recognition
Code Code Available 0Towards Unsupervised Speech Recognition Without Pronunciation Models Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems Nov 3, 2019 Adversarial Attack Speaker Recognition
Code Code Available 0Augmented Cyclic Adversarial Learning for Low Resource Domain Adaptation Jul 1, 2018 Domain Adaptation speech-recognition
Code Code Available 0Arabic Speech Recognition by End-to-End, Modular Systems and Human Jan 21, 2021 Arabic Speech Recognition Automatic Speech Recognition
Code Code Available 0LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training Dec 5, 2017 Federated Learning image-classification
Code Code Available 0Deep Gradient Compression Reduce the Communication Bandwidth For distributed Traning Dec 5, 2017 Federated Learning image-classification
Code Code Available 0VideoBERT: A Joint Model for Video and Language Representation Learning Apr 3, 2019 Action Classification General Classification
Code Code Available 0Use of Deep Learning in Modern Recommendation System: A Summary of Recent Works Dec 20, 2017 Deep Learning Information Retrieval
Code Code Available 0Trace norm regularization and faster inference for embedded speech recognition RNNs Oct 25, 2017 speech-recognition Speech Recognition
Code Code Available 0Understanding Adaptive, Multiscale Temporal Integration In Deep Speech Recognition Systems Dec 1, 2021 speech-recognition Speech Recognition
Code Code Available 0Latent Tree Language Model Nov 1, 2016 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 0Improving RNN Transducer Modeling for End-to-End Speech Recognition Sep 26, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Adversarial Training For Low-Resource Disfluency Correction Jun 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Adversarial Example Detection by Classification for Deep Speech Recognition Oct 22, 2019 Classification General Classification
Code Code Available 0Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search Degradation Sep 13, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Pre-Finetuning for Few-Shot Emotional Speech Recognition Feb 24, 2023 Few-Shot Learning speech-recognition
Code Code Available 0Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks Jun 9, 2015 Constituency Parsing Image Captioning
Code Code Available 0Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain Jun 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech-enhanced and Noise-aware Networks for Robust Speech Recognition Mar 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations Mar 10, 2024 Automatic Speech Recognition Data Augmentation
Code Code Available 0Improving LSTM-CTC based ASR performance in domains with limited training data Jul 3, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Preparing Data from Psychotherapy for Natural Language Processing May 1, 2018 Speech Recognition
Code Code Available 0Preserving spoken content in voice anonymisation with character-level vocoder conditioning Aug 8, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0Improving CTC-based speech recognition via knowledge transferring from pre-trained language models Feb 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Dec 8, 2020 Attribute Disentanglement
Code Code Available 0Pretext Tasks selection for multitask self-supervised speech representation learning Jul 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Scribosermo: Fast Speech-to-Text models for German and other Languages Oct 15, 2021 Speech Recognition Speech-to-Text
Code Code Available 0Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings Jul 1, 2020 GPU speech-recognition
Code Code Available 0DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks Mar 8, 2023 Fault Detection speech-recognition
Code Code Available 0Improving Children's Speech Recognition by Fine-tuning Self-supervised Adult Speech Representations Nov 14, 2022 Self-Supervised Learning speech-recognition
Code Code Available 0Deep-FSMN for Large Vocabulary Continuous Speech Recognition Mar 4, 2018 Language Modeling Language Modelling
Code Code Available 0ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction Oct 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0An End-to-End Neural Network for Polyphonic Piano Music Transcription Aug 7, 2015 Language Modeling Language Modelling
Code Code Available 0Trainable Frontend For Robust and Far-Field Keyword Spotting Jul 19, 2016 Keyword Spotting speech-recognition
Code Code Available 0Understanding the Tradeoffs in Client-side Privacy for Downstream Speech Tasks Jan 22, 2021 Representation Learning speech-recognition
Code Code Available 0Evaluation of End-to-End Continuous Spanish Lipreading in Different Data Conditions Feb 1, 2025 Lipreading speech-recognition
Code Code Available 0Learning Alignment for Multimodal Emotion Recognition from Speech Sep 6, 2019 Emotion Recognition Multimodal Emotion Recognition
Code Code Available 0Pre-training on high-resource speech recognition improves low-resource speech-to-text translation Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators Aug 30, 2019 image-classification Image Classification
Code Code Available 0Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding Feb 10, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improved training for online end-to-end speech recognition systems Nov 6, 2017 speech-recognition Speech Recognition
Code Code Available 0Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences Sep 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Vietnamese Capitalization and Punctuation Recovery Models Jul 4, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0