| Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings | Mar 30, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech | Nov 19, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing | Oct 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training | Oct 12, 2021 | Data AugmentationMulti-Task Learning | CodeCode Available | 1 |
| FastAudio: A Learnable Audio Front-End for Spoof Speech Detection | Sep 6, 2021 | Speaker IdentificationSpeaker Verification | CodeCode Available | 1 |
| Learning Audio-Visual Dereverberation | Jun 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding | Jun 3, 2021 | Conversational Response SelectionLanguage Modeling | CodeCode Available | 1 |
| Supervised Speech Representation Learning for Parkinson's Disease Classification | Jun 1, 2021 | ClassificationRepresentation Learning | CodeCode Available | 1 |
| Speech Resynthesis from Discrete Disentangled Self-Supervised Representations | Apr 1, 2021 | DisentanglementRepresentation Learning | CodeCode Available | 1 |
| Blind Speech Separation and Dereverberation using Neural Beamforming | Mar 24, 2021 | Speaker IdentificationSpeaker Separation | CodeCode Available | 1 |