SOTAVerified

Mathematical Problem-Solving

Papers

Showing 1120 of 106 papers

TitleStatusHype
Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks AutomationCode2
Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph StructuresCode2
MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math DataCode2
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language ModelsCode2
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-SolvingCode2
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique PipelineCode2
Measuring Mathematical Problem Solving With the MATH DatasetCode2
SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM ReasoningCode1
Solving Inequality Proofs with Large Language ModelsCode1
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal ReasoningCode1
Show:102550
← PrevPage 2 of 11Next →

No leaderboard results yet.