SOTAVerified

Continual Pretraining

Papers

Showing 2650 of 70 papers

TitleStatusHype
Bilingual Adaptation of Monolingual Foundation Models0
70B-parameter large language models in Japanese medical question-answering0
Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective0
Open Generative Large Language Models for Galician0
BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM0
Towards Lifelong Learning of Large Language Models: A SurveyCode2
LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models0
Multi-Label Guided Soft Contrastive Learning for Efficient Earth Observation PretrainingCode1
MoRA: High-Rank Updating for Parameter-Efficient Fine-TuningCode3
Cross-sensor self-supervised training and alignment for remote sensing0
ChuXin: 1.6B Technical Report0
Retrieval Head Mechanistically Explains Long-Context FactualityCode3
Pretraining and Updates of Domain-Specific LLM: A Case Study in the Japanese Business Domain0
CEM: A Data-Efficient Method for Large Language Models to Continue Evolving From Mistakes0
Rho-1: Not All Tokens Are What You NeedCode3
Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code0
PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?0
Yi: Open Foundation Models by 01.AICode9
Investigating Continual Pretraining in Large Language Models: Insights and Implications0
Data Engineering for Scaling Language Models to 128K ContextCode3
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical TextsCode2
Continual Learning for Large Language Models: A Survey0
RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via RomanizationCode0
PECoP: Parameter Efficient Continual Pretraining for Action Quality AssessmentCode0
Effective Long-Context Scaling of Foundation ModelsCode2
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.69Unverified
#ModelMetricClaimedVerifiedStatus
1CPTF1 - macro63.77Unverified
#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.71Unverified