SOTAVerified

GSM8K

Papers

Showing 281290 of 439 papers

TitleStatusHype
Evolutionary Pre-Prompt Optimization for Mathematical Reasoning0
Premise Order Matters in Reasoning with Large Language Models0
PREMISE: Scalable and Strategic Prompt Optimization for Efficient Mathematical Reasoning in Large Models0
Evaluation of LLMs for mathematical problem solving0
Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation0
Prompt Baking0
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search0
Prompt Engineering a Prompt Engineer0
Prompt-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression0
Prompt Selection and Augmentation for Few Examples Code Generation in Large Language Model and its Application in Robotics Control0
Show:102550
← PrevPage 29 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified