SOTAVerified

Continual Pretraining

Papers

Showing 2130 of 70 papers

TitleStatusHype
LangSAMP: Language-Script Aware Multilingual PretrainingCode0
Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning ApproachCode0
A Practitioner's Guide to Continual Multimodal PretrainingCode2
RedWhale: An Adapted Korean LLM Through Efficient Continual Pretraining0
Scaling Granite Code Models to 128K ContextCode4
Bilingual Adaptation of Monolingual Foundation Models0
70B-parameter large language models in Japanese medical question-answering0
Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective0
Open Generative Large Language Models for Galician0
BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM0
Show:102550
← PrevPage 3 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.69Unverified
#ModelMetricClaimedVerifiedStatus
1CPTF1 - macro63.77Unverified
#ModelMetricClaimedVerifiedStatus
1DASF1 (macro)0.71Unverified