SOTAVerified

Continual Pretraining

Papers

Showing 125 of 70 papers

TitleStatusHype
Yi: Open Foundation Models by 01.AICode9
Scaling Granite Code Models to 128K ContextCode4
Rho-1: Not All Tokens Are What You NeedCode3
Data Engineering for Scaling Language Models to 128K ContextCode3
MoRA: High-Rank Updating for Parameter-Efficient Fine-TuningCode3
Retrieval Head Mechanistically Explains Long-Context FactualityCode3
Towards Lifelong Learning of Large Language Models: A SurveyCode2
Effective Long-Context Scaling of Foundation ModelsCode2
Continual Training of Language Models for Few-Shot LearningCode2
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical TextsCode2
Continual Pre-training of Language ModelsCode2
A Practitioner's Guide to Continual Multimodal PretrainingCode2
Multi-Label Guided Soft Contrastive Learning for Efficient Earth Observation PretrainingCode1
NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision AnalysisCode1
On the Robustness of Reading Comprehension Models to Entity RenamingCode1
ECONET: Effective Continual Pretraining of Language Models for Event Temporal ReasoningCode1
Efficient Contrastive Learning via Novel Data Augmentation and Curriculum LearningCode1
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology PreservationCode1
Continual Pre-Training Mitigates Forgetting in Language and VisionCode1
Demystifying Domain-adaptive Post-training for Financial LLMsCode1
CTP:Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology PreservationCode1
Towards Geospatial Foundation Models via Continual PretrainingCode1
TiC-LM: A Web-Scale Benchmark for Time-Continual LLM PretrainingCode1
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language ModelCode0
LangSAMP: Language-Script Aware Multilingual PretrainingCode0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.69Unverified
#ModelMetricClaimedVerifiedStatus
1CPTF1 - macro63.77Unverified
#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.71Unverified