| Enhancing Crisis-Related Tweet Classification with Entity-Masked Language Modeling and Multi-Task Learning | Nov 21, 2022 | Hierarchical Multi-label ClassificationLanguage Modeling | CodeCode Available | 0 |
| DS-TOD: Efficient Domain Specialization for Task-Oriented Dialog | May 1, 2022 | dialog state trackingLanguage Modeling | CodeCode Available | 0 |
| Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection | May 23, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 0 |
| DS-TOD: Efficient Domain Specialization for Task Oriented Dialog | Oct 15, 2021 | dialog state trackingLanguage Modeling | CodeCode Available | 0 |
| Distributionally robust self-supervised learning for tabular data | Oct 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Seventeenth-Century Spanish American Notary Records for Fine-Tuning Spanish Large Language Models | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies | Oct 30, 2024 | Language AcquisitionMasked Language Modeling | CodeCode Available | 0 |
| Distilling Knowledge Learned in BERT for Text Generation | Nov 10, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning Better Masking for Better Language Model Pre-training | Aug 23, 2022 | DenoisingLanguage Modeling | CodeCode Available | 0 |