SOTAVerified

Math

Papers

Showing 251275 of 1596 papers

TitleStatusHype
Broken Neural Scaling LawsCode1
Control LLM: Controlled Evolution for Intelligence Retention in LLMCode1
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language ModelsCode1
MathPrompter: Mathematical Reasoning using Large Language ModelsCode1
Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic TasksCode1
Brilla AI: AI Contestant for the National Science and Maths QuizCode1
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for ReasoningCode1
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMsCode1
Ape210K: A Large-Scale and Template-Rich Dataset of Math Word ProblemsCode1
Bridging and Modeling Correlations in Pairwise Data for Direct Preference OptimizationCode1
LoRA Soups: Merging LoRAs for Practical Skill Composition TasksCode1
M1: Towards Scalable Test-Time Compute with Mamba Reasoning ModelsCode1
Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and ObservationsCode1
LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language ModelsCode1
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoningCode1
Boosting Large Language Models with Socratic Method for Conversational Mathematics TeachingCode1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy PreservationCode1
MATHWELL: Generating Educational Math Word Problems Using Teacher AnnotationsCode1
Learning Multi-Step Reasoning by Solving Arithmetic TasksCode1
Learning to Reason Deductively: Math Word Problem Solving as Complex Relation ExtractionCode1
Let's Verify Math Questions Step by StepCode1
BlenderGym: Benchmarking Foundational Model Systems for Graphics EditingCode1
An In-depth Look at Gemini's Language AbilitiesCode1
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct SolutionsCode1
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMsCode1
Show:102550
← PrevPage 11 of 64Next →

No leaderboard results yet.