SOTAVerified

Multiple-choice

Papers

Showing 171180 of 1107 papers

TitleStatusHype
Marathon: A Race Through the Realm of Long Context with Large Language ModelsCode1
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and LayersCode1
Fake Alignment: Are LLMs Really Aligned Well?Code1
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language ModelsCode1
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysisCode1
An Open Source Data Contamination Report for Large Language ModelsCode1
JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuningCode1
OpsEval: A Comprehensive IT Operations Benchmark Suite for Large Language ModelsCode1
BRAINTEASER: Lateral Thinking Puzzles for Large Language ModelsCode1
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language ModelsCode1
Show:102550
← PrevPage 18 of 111Next →

No leaderboard results yet.