SOTAVerified

Continual Pretraining

Papers

Showing 2650 of 70 papers

TitleStatusHype
ChuXin: 1.6B Technical Report0
Continual Learning for Large Language Models: A Survey0
70B-parameter large language models in Japanese medical question-answering0
Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective0
Cross-sensor self-supervised training and alignment for remote sensing0
Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code0
DD-TIG at Constraint@ACL2022: Multimodal Understanding and Reasoning for Role Labeling of Entities in Hateful Memes0
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining0
Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language0
Enhance Mobile Agents Thinking Process Via Iterative Preference Learning0
On the Robustness of Reading Comprehension Models to Entity Renaming0
Open Generative Large Language Models for Galician0
Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling0
PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?0
Pretraining and Updates of Domain-Specific LLM: A Case Study in the Japanese Business Domain0
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models0
Revisiting Pretraining with Adapters0
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text0
The Construction of Instruction-tuned LLMs for Finance without Instruction Data Using Continual Pretraining and Model Merging0
AdaPrompt: Adaptive Model Training for Prompt-based NLP0
BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM0
Bilingual Adaptation of Monolingual Foundation Models0
Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content0
Investigating Continual Pretraining in Large Language Models: Insights and Implications0
Is Domain Adaptation Worth Your Investment? Comparing BERT and FinBERT on Financial Tasks0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.69Unverified
#ModelMetricClaimedVerifiedStatus
1CPTF1 - macro63.77Unverified
#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.71Unverified