Large-Scale Streaming End-to-End Speech Translation with Neural Transducers Apr 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 13M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition Apr 7, 2022 Mixture-of-Experts speech-recognition
Code Code Available 1Low-Latency Speech Separation Guided Diarization for Telephone Conversations Apr 5, 2022 Action Detection Activity Detection
Code Code Available 1PriMock57: A Dataset Of Primary Care Mock Consultations Apr 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HiFi-VC: High Quality ASR-Based Voice Conversion Mar 31, 2022 speech-recognition Speech Recognition
Code Code Available 1A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings Mar 30, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Recent improvements of ASR models in the face of adversarial attacks Mar 29, 2022 speech-recognition Speech Recognition
Code Code Available 1Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Earnings-22: A Practical Benchmark for Accents in the Wild Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Integrating Lattice-Free MMI into End-to-End Speech Recognition Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT Mar 29, 2022 All Automatic Speech Recognition
Code Code Available 1Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition Mar 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations Mar 25, 2022 Federated Learning Quantization
Code Code Available 1Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing Mar 18, 2022 Representation Learning Speaker Verification
Code Code Available 1Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition Mar 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Mar 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors Mar 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AISHELL-NER: Named Entity Recognition from Chinese Speech Feb 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language Feb 7, 2022 image-classification Image Classification
Code Code Available 1Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition Feb 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Self-supervised Learning with Random-projection Quantizer for Speech Recognition Feb 3, 2022 Self-Supervised Learning speech-recognition
Code Code Available 1Streaming Multi-Talker ASR with Token-Level Serialized Output Training Feb 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection Jan 30, 2022 speech-recognition Speech Recognition
Code Code Available 1CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition Jan 11, 2022 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model Jan 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Discriminative Hierarchical PLDA-based Model for Spoken Language Recognition Jan 4, 2022 Machine Translation speech-recognition
Code Code Available 1Adversarial Attacks against Windows PE Malware Detection: A Survey of the State-of-the-Art Dec 23, 2021 Adversarial Attack Malware Detection
Code Code Available 1Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement Dec 21, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Self-Supervised Learning for speech recognition with Intermediate layer supervision Dec 16, 2021 Language Modeling Language Modelling
Code Code Available 1X-Vector based voice activity detection for multi-genre broadcast speech-to-text Dec 9, 2021 Action Detection Activity Detection
Code Code Available 1Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI Dec 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Romanian Speech Recognition Experiments from the ROBIN Project Nov 23, 2021 Language Modelling speech-recognition
Code Code Available 1SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech Nov 19, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale Nov 17, 2021 Language Identification Representation Learning
Code Code Available 1MT3: Multi-Task Multitrack Music Transcription Nov 4, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Cross Attention Augmented Transducer Networks for Simultaneous Translation Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Efficiently Modeling Long Sequences with Structured State Spaces Oct 31, 2021 Data Augmentation Language Modeling
Code Code Available 1Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model Oct 31, 2021 Decoder speech-recognition
Code Code Available 1SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 1