Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit Dec 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information Dec 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers Dec 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Progressive Multi-Scale Self-Supervised Learning for Speech Recognition Dec 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robust Speech Recognition via Large-Scale Weak Supervision Dec 6, 2022 Robust Speech Recognition speech-recognition
Code Code Available 8SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels Dec 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition Dec 5, 2022 Position speech-recognition
Code Code Available 0Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models Dec 5, 2022 Language Modeling Language Modelling
— Unverified 0Unsupervised Fine-Tuning Data Selection for ASR Using Self-Supervised Speech Models Dec 3, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continual Learning for On-Device Speech Recognition using Disentangled Conformers Dec 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fuse and Adapt: Investigating the Use of Pre-Trained Self-Supervising Learning Models in Limited Data NLU problems Dec 2, 2022 Domain Adaptation Emotion Recognition
— Unverified 0Cross-Modal Mutual Learning for Cued Speech Recognition Dec 2, 2022 speech-recognition Speech Recognition
— Unverified 0SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition Dec 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Surrogate Gradient Spiking Neural Networks as Encoders for Large Vocabulary Continuous Speech Recognition Dec 1, 2022 speech-recognition Speech Recognition
— Unverified 0Gated Recurrent Neural Networks with Weighted Time-Delay Feedback Dec 1, 2022 Activity Recognition Human Activity Recognition
— Unverified 0Preliminary Study on SSCF-derived Polar Coordinate for ASR Nov 30, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0EURO: ESPnet Unsupervised ASR Open-source Toolkit Nov 30, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing Nov 30, 2022 Machine Translation Sentence
— Unverified 0MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition Nov 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems Nov 29, 2022 speech-recognition Speech Recognition
Code Code Available 1Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation Nov 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Better Transcription of UK Supreme Court Hearings Nov 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluating and reducing the distance between synthetic and real speech distributions Nov 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Handling and extracting key entities from customer conversations using Speech recognition and Named Entity recognition Nov 28, 2022 named-entity-recognition Named Entity Recognition
— Unverified 0Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition Nov 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bidirectional Representations for Low Resource Spoken Language Understanding Nov 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multitask Learning for Low Resource Spoken Language Understanding Nov 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Multi-task Learning via Seeking Task-based Flat Regions Nov 24, 2022 Multi-Task Learning speech-recognition
— Unverified 0Whose Emotion Matters? Speaking Activity Localisation without Prior Knowledge Nov 23, 2022 Active Speaker Detection Automatic Speech Recognition
Code Code Available 0Device Directedness with Contextual Cues for Spoken Dialog Systems Nov 23, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction Nov 23, 2022 Decoder Sentence
— Unverified 0Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition Nov 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation Nov 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SSCFormer: Push the Limit of Chunk-wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution Nov 21, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards continually learning new languages Nov 21, 2022 All speech-recognition
— Unverified 0SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale Nov 21, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning Nov 21, 2022 Audio-Visual Speech Recognition Language Modelling
— Unverified 0Constructing Effective Machine Learning Models for the Sciences: A Multidisciplinary Perspective Nov 21, 2022 regression speech-recognition
— Unverified 0Exploring WavLM on Speech Enhancement Nov 18, 2022 Self-Supervised Learning Speech Enhancement
— Unverified 0A Persian ASR-based SER: Modification of Sharif Emotional Speech Database and Investigation of Persian Text Corpora Nov 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Hey ASR System! Why Aren't You More Inclusive? Automatic Speech Recognition Systems' Bias and Proposed Bias Mitigation Techniques. A Literature Review Nov 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised Model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognition Nov 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MelHuBERT: A simplified HuBERT on Mel spectrograms Nov 17, 2022 Automatic Speech Recognition Self-Supervised Learning
Code Code Available 1LongFNT: Long-form Speech Recognition with Factorized Neural Transducer Nov 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0L2 proficiency assessment using self-supervised speech representations Nov 16, 2022 speech-recognition Speech Recognition
— Unverified 0Improving Speech Emotion Recognition with Unsupervised Speaking Style Transfer Nov 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches Nov 16, 2022 Action Detection Activity Detection
— Unverified 0Streaming Joint Speech Recognition and Disfluency Detection Nov 16, 2022 Decoder Language Modelling
Code Code Available 0Introducing Semantics into Speech Encoders Nov 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder Nov 15, 2022 Contrastive Learning Disentanglement
— Unverified 0