SOTAVerified

GSM8K

Papers

Showing 101110 of 439 papers

TitleStatusHype
DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical ReasoningCode1
Efficient Reasoning for LLMs through Speculative Chain-of-ThoughtCode1
Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with AutoformalizationCode1
Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt TemplatesCode1
IRanker: Towards Ranking Foundation ModelCode1
Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning TasksCode1
Matrix Information Theory for Self-Supervised LearningCode1
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning ProofsCode1
Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator AgentCode1
MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought ThinkingCode1
Show:102550
← PrevPage 11 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified