| Cascaded Head-colliding Attention | May 31, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning Hierarchical Structures with Differentiable Nondeterministic Stacks | Sep 5, 2021 | Inductive BiasLanguage Modeling | CodeCode Available | 1 | 5 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 | 5 |
| Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models | Mar 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Federated Learning for ASR based on Wav2vec 2.0 | Feb 20, 2023 | Federated LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring | Apr 1, 2022 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Oct 16, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 | 5 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |