SOTAVerified

Continual Pretraining

Papers

Showing 125 of 70 papers

TitleStatusHype
Yi: Open Foundation Models by 01.AICode9
Scaling Granite Code Models to 128K ContextCode4
Retrieval Head Mechanistically Explains Long-Context FactualityCode3
Data Engineering for Scaling Language Models to 128K ContextCode3
Rho-1: Not All Tokens Are What You NeedCode3
MoRA: High-Rank Updating for Parameter-Efficient Fine-TuningCode3
Towards Lifelong Learning of Large Language Models: A SurveyCode2
A Practitioner's Guide to Continual Multimodal PretrainingCode2
Continual Training of Language Models for Few-Shot LearningCode2
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical TextsCode2
Continual Pre-training of Language ModelsCode2
Effective Long-Context Scaling of Foundation ModelsCode2
TiC-LM: A Web-Scale Benchmark for Time-Continual LLM PretrainingCode1
ECONET: Effective Continual Pretraining of Language Models for Event Temporal ReasoningCode1
Multi-Label Guided Soft Contrastive Learning for Efficient Earth Observation PretrainingCode1
Towards Geospatial Foundation Models via Continual PretrainingCode1
NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision AnalysisCode1
Continual Pre-Training Mitigates Forgetting in Language and VisionCode1
Demystifying Domain-adaptive Post-training for Financial LLMsCode1
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology PreservationCode1
CTP:Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology PreservationCode1
Efficient Contrastive Learning via Novel Data Augmentation and Curriculum LearningCode1
On the Robustness of Reading Comprehension Models to Entity RenamingCode1
ChuXin: 1.6B Technical Report0
AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for Astronomy0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.69Unverified
#ModelMetricClaimedVerifiedStatus
1CPTF1 - macro63.77Unverified
#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.71Unverified