| DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning | May 17, 2023 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| A multimodal dynamical variational autoencoder for audiovisual speech representation learning | May 5, 2023 | DenoisingDisentanglement | CodeCode Available | 0 |
| Learning Cross-lingual Visual Speech Representations | Mar 14, 2023 | Representation LearningSelf-Supervised Learning | —Unverified | 0 |
| FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation Learning | Mar 9, 2023 | 3D Face AnimationRepresentation Learning | CodeCode Available | 1 |
| Self-supervised speech representation learning for keyword-spotting with light-weight transformers | Mar 7, 2023 | Keyword SpottingRepresentation Learning | —Unverified | 0 |
| Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding | Feb 27, 2023 | Model CompressionRepresentation Learning | CodeCode Available | 1 |
| A low latency attention module for streaming self-supervised speech representation learning | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Efficient Speech Representation Learning with Low-Bit Quantization | Dec 14, 2022 | Model CompressionQuantization | —Unverified | 0 |
| Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information | Dec 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Disentangled Feature Learning for Real-Time Neural Speech Coding | Nov 22, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 |