SOTAVerified

GSM8K

Papers

Showing 311320 of 439 papers

TitleStatusHype
Self-Explore: Enhancing Mathematical Reasoning in Language Models with Fine-grained RewardsCode2
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language ModelsCode3
Automatic Prompt Selection for Large Language Models0
Prompt-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression0
Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with AutoformalizationCode1
Supervisory Prompt Training0
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningCode9
LLM2LLM: Boosting LLMs with Novel Iterative Data EnhancementCode2
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt CompressionCode9
Self-Consistency Boosts Calibration for Math Reasoning0
Show:102550
← PrevPage 32 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified