SOTAVerified

MMLU

Papers

Showing 191200 of 340 papers

TitleStatusHype
BrainTransformers: SNN-LLM0
Efficiently Deploying LLMs with Controlled Risk0
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs0
SSR: Alignment-Aware Modality Connector for Speech Language Models0
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining0
Instance-adaptive Zero-shot Chain-of-Thought Prompting0
Uncovering Latent Chain of Thought Vectors in Language Models0
Bilingual Evaluation of Language Models on General Knowledge in University Entrance Exams with Minimal Contamination0
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoningCode1
GRIN: GRadient-INformed MoE0
Show:102550
← PrevPage 20 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1go ahead, make my dataFinal_score61.72Unverified
2#GreedyCowFinal_score61.63Unverified
3Don't Ask Us yFinal_score61.4Unverified
4Data_and_ConfusedFinal_score60.96Unverified
5WafflesFinal_score60.91Unverified
6raakaFinal_score60.91Unverified
7Team ProcrustinationFinal_score60.64Unverified
8Axiom Consulting PartnersFinal_score60.63Unverified
9Lets_Be_FairFinal_score60.23Unverified
10goonersFinal_score60.22Unverified
#ModelMetricClaimedVerifiedStatus
1Orange-mini0-shot MRR99.19Unverified
#ModelMetricClaimedVerifiedStatus
1HybridBeam+SI-SDRi13.3Unverified