| Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Jan 3, 2024 | Data Augmentationfill-mask | —Unverified | 0 |
| Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge | Dec 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Joint unsupervised and supervised learning for context-aware language identification | Mar 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Joint Unsupervised and Supervised Training for Multilingual ASR | Nov 15, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering | May 6, 2022 | Contrastive LearningExtractive Question-Answering | —Unverified | 0 |
| Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search | Dec 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models | Mar 27, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget | Apr 30, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little | Apr 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |