| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 | 5 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Dict-BERT: Enhancing Language Model Pre-training with Dictionary | Oct 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Adapting Learned Sparse Retrieval for Long Documents | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked Language Models are Good Heterogeneous Graph Generalizers | Jun 6, 2025 | Graph LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 | 5 |
| DIBERT: Dependency Injected Bidirectional Encoder Representations from Transformers | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection | May 23, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 0 | 5 |