| Masked Language Models are Good Heterogeneous Graph Generalizers | Jun 6, 2025 | Graph LearningLanguage Modeling | CodeCode Available | 0 |
| MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection | Sep 16, 2021 | AttributeLanguage Modeling | CodeCode Available | 0 |
| Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual Templates | May 1, 2022 | Cross-Lingual Natural Language InferenceCross-Lingual Transfer | CodeCode Available | 0 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Enhancing Crisis-Related Tweet Classification with Entity-Masked Language Modeling and Multi-Task Learning | Nov 21, 2022 | Hierarchical Multi-label ClassificationLanguage Modeling | CodeCode Available | 0 |
| DS-TOD: Efficient Domain Specialization for Task-Oriented Dialog | May 1, 2022 | dialog state trackingLanguage Modeling | CodeCode Available | 0 |
| Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection | May 23, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 0 |
| DS-TOD: Efficient Domain Specialization for Task Oriented Dialog | Oct 15, 2021 | dialog state trackingLanguage Modeling | CodeCode Available | 0 |
| Distributionally robust self-supervised learning for tabular data | Oct 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Seventeenth-Century Spanish American Notary Records for Fine-Tuning Spanish Large Language Models | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies | Oct 30, 2024 | Language AcquisitionMasked Language Modeling | CodeCode Available | 0 |
| Distilling Knowledge Learned in BERT for Text Generation | Nov 10, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning Better Masking for Better Language Model Pre-training | Aug 23, 2022 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| Unlocking Efficiency: Adaptive Masking for Gene Transformer Models | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking | Apr 18, 2022 | cross-modal alignmentDocument AI | CodeCode Available | 0 |
| Latent State Models of Training Dynamics | Aug 18, 2023 | image-classificationImage Classification | CodeCode Available | 0 |
| DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Adapting Learned Sparse Retrieval for Long Documents | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SJ_AJ@DravidianLangTech-EACL2021: Task-Adaptive Pre-Training of Multilingual BERT models for Offensive Language Identification | Feb 1, 2021 | Language IdentificationLanguage Modeling | CodeCode Available | 0 |
| Multilingual Normalization of Temporal Expressions with Masked Language Models | May 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Multilinguals at SemEval-2022 Task 11: Complex NER in Semantically Ambiguous Settings for Low Resource Languages | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Unifying Reference Expression Generation and Comprehension | Oct 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic | May 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |