SOTAVerified

Math

Papers

Showing 125 of 1596 papers

TitleStatusHype
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All ToolsCode14
Qwen2.5 Technical ReportCode13
Qwen2.5-Coder Technical ReportCode11
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end ModelCode9
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language ModelsCode9
AgentRxiv: Towards Collaborative Autonomous ResearchCode9
s1: Simple test-time scalingCode9
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code IntelligenceCode9
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference FeedbackCode7
DSPy: Compiling Declarative Language Model Calls into Self-Improving PipelinesCode7
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the WildCode7
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement LearningCode7
EvoAgentX: An Automated Framework for Evolving Agentic WorkflowsCode7
O1 Replication Journey: A Strategic Progress Report -- Part 1Code7
xLSTM 7B: A Recurrent LLM for Fast and Efficient InferenceCode7
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsCode7
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language ReasoningCode7
OpenThoughts: Data Recipes for Reasoning ModelsCode7
Kimi k1.5: Scaling Reinforcement Learning with LLMsCode7
StarCoder 2 and The Stack v2: The Next GenerationCode7
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!Code7
S*: Test Time Scaling for Code GenerationCode7
TTRL: Test-Time Reinforcement LearningCode7
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep ThinkingCode7
Mistral 7BCode6
Show:102550
← PrevPage 1 of 64Next →

No leaderboard results yet.