| Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget | Apr 30, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning | Apr 27, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| Emerging Property of Masked Token for Effective Pre-training | Apr 12, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| OPSD: an Offensive Persian Social media Dataset and its baseline evaluations | Apr 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Effectively Prompting Small-sized Language Models for Cross-lingual Tasks via Winning Tickets | Apr 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining | Apr 1, 2024 | Contrastive LearningImage-text matching | —Unverified | 0 |
| Developing Healthcare Language Model Embedding Spaces | Mar 28, 2024 | Contrastive LearningDocument Classification | —Unverified | 0 |
| Fingerprinting web servers through Transformer-encoded HTTP response headers | Mar 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Detecting Bias in Large Language Models: Fine-tuned KcBERT | Mar 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Merging Text Transformer Models from Different Initializations | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |