| Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective | Jun 19, 2024 | BenchmarkingContinual Pretraining | —Unverified | 0 | 0 |
| Cross-sensor self-supervised training and alignment for remote sensing | May 16, 2024 | Continual PretrainingEarth Observation | —Unverified | 0 | 0 |
| Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code | Mar 30, 2024 | Continual PretrainingLanguage Modelling | —Unverified | 0 | 0 |
| DD-TIG at Constraint@ACL2022: Multimodal Understanding and Reasoning for Role Labeling of Entities in Hateful Memes | May 1, 2022 | Continual PretrainingData Augmentation | —Unverified | 0 | 0 |
| DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining | Sep 30, 2024 | Continual PretrainingDomain Adaptation | —Unverified | 0 | 0 |
| Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language | Apr 28, 2025 | Continual PretrainingGPU | —Unverified | 0 | 0 |
| Enhance Mobile Agents Thinking Process Via Iterative Preference Learning | May 18, 2025 | Continual Pretraining | —Unverified | 0 | 0 |
| Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without Them | Mar 27, 2025 | Continual PretrainingLanguage Modeling | —Unverified | 0 | 0 |
| Investigating Continual Pretraining in Large Language Models: Insights and Implications | Feb 27, 2024 | Continual LearningContinual Pretraining | —Unverified | 0 | 0 |
| Is Domain Adaptation Worth Your Investment? Comparing BERT and FinBERT on Financial Tasks | Nov 1, 2021 | Continual PretrainingDomain Adaptation | —Unverified | 0 | 0 |
| CEM: A Data-Efficient Method for Large Language Models to Continue Evolving From Mistakes | Apr 11, 2024 | Continual LearningContinual Pretraining | —Unverified | 0 | 0 |
| Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora | Oct 16, 2021 | Continual LearningContinual Pretraining | —Unverified | 0 | 0 |
| LLaVA-c: Continual Improved Visual Instruction Tuning | Jun 10, 2025 | Continual LearningContinual Pretraining | —Unverified | 0 | 0 |
| LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models | Jun 2, 2024 | Continual PretrainingInformation Retrieval | —Unverified | 0 | 0 |
| Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning | May 15, 2025 | Continual PretrainingMMLU | —Unverified | 0 | 0 |
| AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for Astronomy | Sep 29, 2024 | AstronomyBenchmarking | —Unverified | 0 | 0 |
| Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study | Feb 4, 2025 | Continual PretrainingMachine Translation | —Unverified | 0 | 0 |
| On the Robustness of Reading Comprehension Models to Entity Renaming | Nov 16, 2021 | Continual PretrainingMachine Reading Comprehension | —Unverified | 0 | 0 |
| Open Generative Large Language Models for Galician | Jun 19, 2024 | Continual PretrainingDiversity | —Unverified | 0 | 0 |
| Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling | Mar 24, 2025 | Continual PretrainingLanguage Modeling | —Unverified | 0 | 0 |