| Dynamic Masking Rate Schedules for MLM Pretraining | May 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction | Oct 13, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| A Progressive Transformer for Unifying Binary Code Embedding and Knowledge Transfer | Dec 15, 2024 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Causal Distillation for Language Models | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DS-TOD: Efficient Domain Specialization for Task-Oriented Dialog | Nov 16, 2021 | dialog state trackingLanguage Modeling | —Unverified | 0 |
| Adversarial Soft Prompt Tuning for Cross-Domain Sentiment Analysis | May 1, 2022 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining | Jan 29, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned and Perspectives | Feb 25, 2021 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| LakotaBERT: A Transformer-based Model for Low Resource Lakota Language | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LAnoBERT: System Log Anomaly Detection based on BERT Masked Language Model | Nov 18, 2021 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| Do Transformers Parse while Predicting the Masked Word? | Mar 14, 2023 | Constituency ParsingLanguage Modeling | —Unverified | 0 |
| Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling | Jan 25, 2024 | Causal Language ModelingDecoder | —Unverified | 0 |
| Capturing Topic Framing via Masked Language Modeling | Feb 7, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| Domain-Specific Japanese ELECTRA Model Using a Small Corpus | Sep 1, 2021 | ArticlesComputational Efficiency | —Unverified | 0 |
| APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning | Dec 19, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Domain-adapted large language models for classifying nuclear medicine reports | Mar 1, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge | Dec 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CamemBERT 2.0: A Smarter French Language Model Aged to Perfection | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adversarial Generation and Encoding of Nested Texts | Jun 1, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Pilot Study on Dialogue-Level Dependency Parsing for Chinese | May 21, 2023 | Dependency ParsingLanguage Modeling | —Unverified | 0 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Nov 16, 2021 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget | Apr 30, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| KUL@SMM4H’22: Template Augmented Adaptive Pre-training for Tweet Classification | Oct 1, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Discovering Financial Hypernyms by Prompting Masked Language Models | Jun 1, 2022 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| AntLM: Bridging Causal and Masked Language Models | Dec 4, 2024 | Causal Language ModelingDecoder | —Unverified | 0 |