| The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge | Feb 4, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| Joint Speech Recognition and Audio Captioning | Feb 3, 2022 | AudioCapsAudio captioning | —Unverified | 0 |
| The RoyalFlush System of Speech Recognition for M2MeT Challenge | Feb 3, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Streaming Multi-Talker ASR with Token-Level Serialized Output Training | Feb 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ASR-Aware End-to-end Neural Diarization | Feb 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Error Correction in ASR using Sequence-to-Sequence Models | Feb 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| RescoreBERT: Discriminative Speech Recognition Rescoring with BERT | Feb 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Language Dependencies in Adversarial Attacks on Speech Recognition Systems | Feb 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BEA-Base: A Benchmark for ASR of Spontaneous Hungarian | Feb 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Visualizing Automatic Speech Recognition -- Means for a Better Understanding? | Feb 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Reducing language context confusion for end-to-end code-switching automatic speech recognition | Jan 28, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Star Temporal Classification: Sequence Classification with Partially Labeled Data | Jan 28, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition | Jan 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition | Jan 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition | Jan 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR | Jan 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Norwegian Parliamentary Speech Corpus | Jan 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving the fusion of acoustic and text representations in RNN-T | Jan 25, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models | Jan 25, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR | Jan 25, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video | Jan 25, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Unified Multimodal Punctuation Restoration Framework for Mixed-Modality Corpus | Jan 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition | Jan 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR | Jan 18, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Human and Automatic Speech Recognition Performance on German Oral History Interviews | Jan 18, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning | Jan 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| RED-ACE: Robust Error Detection for ASR using Confidence Embeddings | Jan 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Recent Progress in the CUHK Dysarthric Speech Recognition System | Jan 15, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition | Jan 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Likelihood Ratio based Domain Adaptation Method for E2E Models | Jan 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection | Jan 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks | Jan 8, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset | Jan 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model | Jan 6, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Robust Self-Supervised Audio-Visual Speech Recognition | Jan 5, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 2 |
| Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction | Jan 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question | Jan 4, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Tencent-MVSE: A Large-Scale Benchmark Dataset for Multi-Modal Video Similarity Evaluation | Jan 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Dialect Arabic Speech Recognition | Dec 25, 2021 | Arabic Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Multi-Variant Consistency based Self-supervised Learning for Robust Automatic Speech Recognition | Dec 23, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement | Dec 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Voice Quality and Pitch Features in Transformer-Based Speech Recognition | Dec 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching | Dec 19, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-turn RNN-T for streaming recognition of multi-party speech | Dec 19, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Continual Learning for Monolingual End-to-End Automatic Speech Recognition | Dec 17, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems | Dec 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Real-Time Neural Voice Camouflage | Dec 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model | Dec 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Robustifying automatic speech recognition by extracting slowly varying features | Dec 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition | Dec 13, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |