Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation Apr 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition Apr 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking Apr 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automated speech tools for helping communities process restricted-access corpora for language revival efforts Apr 15, 2022 Action Detection Activity Detection
— Unverified 0Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features Apr 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes Apr 13, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Study of Indian English Pronunciation Variabilities relative to Received Pronunciation Apr 13, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-critical Sequence Training for Automatic Speech Recognition Apr 13, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition Apr 13, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction Apr 12, 2022 speech-recognition Speech Recognition
— Unverified 0ASR in German: A Detailed Error Analysis Apr 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding Apr 11, 2022 Decoder Lipreading
— Unverified 0Large-Scale Streaming End-to-End Speech Translation with Neural Transducers Apr 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Unified Speech-Text Pre-training for Speech Translation and Recognition Apr 11, 2022 Decoder speech-recognition
— Unverified 0Building an ASR Error Robust Spoken Virtual Patient System in a Highly Class-Imbalanced Scenario Without Speech Data Apr 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Deep Embeddings for Robust User-Based Amateur Vocal Percussion Classification Apr 10, 2022 Classification feature selection
— Unverified 0Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction Apr 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition Apr 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners Apr 8, 2022 Prediction Speech Enhancement
Code Code Available 0Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition Apr 8, 2022 Decoder speech-recognition
— Unverified 0Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser Apr 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition Apr 8, 2022 speech-recognition Speech Recognition
Code Code Available 0Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition Apr 8, 2022 Action Detection Activity Detection
— Unverified 0Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0 Apr 7, 2022 Multi-Task Learning speech-recognition
— Unverified 0Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model Apr 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MAESTRO: Matched Speech Text Representations through Modality Matching Apr 7, 2022 Language Modelling Self-Supervised Learning
— Unverified 03M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition Apr 7, 2022 Mixture-of-Experts speech-recognition
Code Code Available 1Enabling All In-Edge Deep Learning: A Literature Review Apr 7, 2022 All Deep Learning
— Unverified 0A Wav2vec2-Based Experimental Study on Self-Supervised Learning Methods to Improve Child Speech Recognition Apr 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Successes and critical failures of neural networks in capturing human-like speech recognition Apr 6, 2022 speech-recognition Speech Recognition
— Unverified 0Emotional Speech Recognition with Pre-trained Deep Visual Models Apr 6, 2022 Emotion Recognition speech-recognition
Code Code Available 0Simple and Effective Unsupervised Speech Synthesis Apr 6, 2022 speech-recognition Speech Recognition
— Unverified 0A survey on recently proposed activation functions for Deep Learning Apr 6, 2022 Deep Learning speech-recognition
— Unverified 0Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation Apr 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition Apr 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised Data Selection via Discrete Speech Representation for ASR Apr 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective Apr 5, 2022 Disentanglement Representation Learning
— Unverified 0Audio-visual multi-channel speech separation, dereverberation and recognition Apr 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation Apr 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards End-to-end Unsupervised Speech Recognition Apr 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Low-Latency Speech Separation Guided Diarization for Telephone Conversations Apr 5, 2022 Action Detection Activity Detection
Code Code Available 1Hear No Evil: Towards Adversarial Robustness of Automatic Speech Recognition via Multi-Task Learning Apr 5, 2022 Adversarial Attack Adversarial Robustness
— Unverified 0Deliberation Model for On-Device Spoken Language Understanding Apr 4, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Analysis of Semantically-Aligned Speech-Text Embeddings Apr 4, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition Apr 4, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems Apr 4, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-Supervised Speech Representations Preserve Speech Characteristics while Anonymizing Voices Apr 4, 2022 Speaker Verification speech-recognition
— Unverified 0Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents Apr 3, 2022 speech-recognition Speech Recognition
— Unverified 0Speaker adaptation for Wav2vec2 based dysarthric ASR Apr 2, 2022 speech-recognition Speech Recognition
— Unverified 0Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation Apr 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0