| MelHuBERT: A simplified HuBERT on Mel spectrograms | Nov 17, 2022 | Automatic Speech RecognitionSelf-Supervised Learning | CodeCode Available | 1 |
| LongFNT: Long-form Speech Recognition with Factorized Neural Transducer | Nov 17, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognition | Nov 17, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches | Nov 16, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| Improving Speech Emotion Recognition with Unsupervised Speaking Style Transfer | Nov 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Introducing Semantics into Speech Encoders | Nov 15, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards A Unified Conformer Structure: from ASR to ASV Task | Nov 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets | Nov 14, 2022 | Automatic Speech RecognitionMulti-Task Learning | CodeCode Available | 1 |
| Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts | Nov 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation | Nov 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |