| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 | 5 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Contextualized Semantic Distance between Highly Overlapped Texts | Oct 4, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 | 5 |
| Masked Language Models are Good Heterogeneous Graph Generalizers | Jun 6, 2025 | Graph LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Deep Transformers with Latent Depth | Sep 28, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Bidirectional Transformer Reranker for Grammatical Error Correction | May 22, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 0 | 5 |
| HanTrans: An Empirical Study on Cross-Era Transferability of Chinese Pre-trained Language Model | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection | May 23, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 0 | 5 |
| Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |