AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LAE: Language-Aware Encoder for Monolingual and Multilingual ASR Jun 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners May 22, 2022 Attribute Automatic Speech Recognition
Code Code Available 1Vietnamese Automatic Speech Recognition using Wav2vec 2.0 May 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment May 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Speaker Recognition in the Wild May 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages May 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Large-Scale Streaming End-to-End Speech Translation with Neural Transducers Apr 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1PriMock57: A Dataset Of Primary Care Mock Consultations Apr 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings Mar 30, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Integrating Lattice-Free MMI into End-to-End Speech Recognition Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT Mar 29, 2022 All Automatic Speech Recognition
Code Code Available 1Earnings-22: A Practical Benchmark for Accents in the Wild Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition Mar 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition Mar 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Mar 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors Mar 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AISHELL-NER: Named Entity Recognition from Chinese Speech Feb 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition Feb 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Streaming Multi-Talker ASR with Token-Level Serialized Output Training Feb 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Unified Multimodal Punctuation Restoration Framework for Mixed-Modality Corpus Jan 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model Jan 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement Dec 21, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1X-Vector based voice activity detection for multi-genre broadcast speech-to-text Dec 9, 2021 Action Detection Activity Detection
Code Code Available 1Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI Dec 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech Nov 19, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MT3: Multi-Task Multitrack Music Transcription Nov 4, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Cross Attention Augmented Transducer Networks for Simultaneous Translation Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 1Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1FAST-RIR: Fast neural diffuse room impulse response generator Oct 7, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Factorized Neural Transducer for Efficient Language Model Adaptation Sep 27, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition Sep 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Vietnamese end-to-end speech recognition using wav2vec 2.0 Sep 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition Aug 31, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification Aug 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1