SOTAVerified

GSM8K

Papers

Showing 110 of 439 papers

TitleStatusHype
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All ToolsCode14
Qwen2 Technical ReportCode13
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningCode9
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt CompressionCode9
Qwen2.5-Omni Technical ReportCode7
Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM TrainingCode7
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsCode6
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8BCode5
Common 7B Language Models Already Possess Strong Math CapabilitiesCode5
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsCode5
Show:102550
← PrevPage 1 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified