| Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors | Mar 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| AISHELL-NER: Named Entity Recognition from Chinese Speech | Feb 17, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition | Feb 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Streaming Multi-Talker ASR with Token-Level Serialized Output Training | Feb 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Unified Multimodal Punctuation Restoration Framework for Mixed-Modality Corpus | Jan 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model | Jan 6, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement | Dec 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| X-Vector based voice activity detection for multi-genre broadcast speech-to-text | Dec 9, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI | Dec 5, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech | Nov 19, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| MT3: Multi-Task Multitrack Music Transcription | Nov 4, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Cross Attention Augmented Transducer Networks for Simultaneous Translation | Nov 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A transfer learning based approach for pronunciation scoring | Nov 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing | Oct 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese | Oct 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications | Oct 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition | Oct 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables | Oct 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| FAST-RIR: Fast neural diffuse room impulse response generator | Oct 7, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Factorized Neural Transducer for Efficient Language Model Adaptation | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition | Sep 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Vietnamese end-to-end speech recognition using wav2vec 2.0 | Sep 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition | Aug 31, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification | Aug 5, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English | Aug 3, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments | Jul 30, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| The History of Speech Recognition to the Year 2030 | Jul 30, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 | Jul 23, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Token-Level Supervised Contrastive Learning for Punctuation Restoration | Jul 19, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| STRODE: Stochastic Boundary Ordinary Differential Equation | Jul 17, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Comparison of Methods for OOV-word Recognition on a New Public Dataset | Jul 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Layer-wise Analysis of a Self-supervised Speech Representation Model | Jul 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| TENET: A Time-reversal Enhancement Network for Noise-robust ASR | Jul 4, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition | Jul 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Learning Audio-Visual Dereverberation | Jun 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Incorporating External POS Tagger for Punctuation Restoration | Jun 12, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Lightweight Adapter Tuning for Multilingual Speech Translation | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Attention-based Contextual Language Model Adaptation for Speech Recognition | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation | May 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| End-to-End Speech Recognition from Federated Acoustic Models | Apr 29, 2021 | 2k4k | CodeCode Available | 1 |
| LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech | Apr 23, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Toolbox for Construction and Analysis of Speech Datasets | Apr 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| RNN Transducer Models For Spoken Language Understanding | Apr 8, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs | Apr 7, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi | Apr 3, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Integer-only Zero-shot Quantization for Efficient Speech Recognition | Mar 31, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages | Mar 26, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Radically Old Way of Computing Spectra: Applications in End-to-End ASR | Mar 25, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |