SOTAVerified

Math

Papers

Showing 376400 of 1596 papers

TitleStatusHype
A Symbolic Character-Aware Model for Solving Geometry ProblemsCode1
Non-myopic Generation of Language Models for Reasoning and PlanningCode1
ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World ChallengesCode1
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language ModelsCode1
Design and implementation of an environment for Learning to Run a Power Network (L2RPN)Code1
Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model ReasoningCode1
FinanceMath: Knowledge-Intensive Math Reasoning in Finance DomainsCode1
JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem UnderstandingCode1
CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical ReasoningCode1
Collective Constitutional AI: Aligning a Language Model with Public InputCode1
A Categorical Archive of ChatGPT FailuresCode1
Injecting Numerical Reasoning Skills into Language ModelsCode1
Implicit Chain of Thought Reasoning via Knowledge DistillationCode1
How well do Large Language Models perform in Arithmetic tasks?Code1
Improving the Validity of Automatically Generated Feedback via Reinforcement LearningCode1
Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation CapabilitiesCode1
Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive PrinciplesCode1
Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with AutoformalizationCode1
How to Get Your LLM to Generate Challenging Problems for EvaluationCode1
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom InstructionCode1
HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM SystemsCode1
HARDMath: A Benchmark Dataset for Challenging Problems in Applied MathematicsCode1
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM ReasoningCode1
Graph-to-Tree Learning for Solving Math Word ProblemsCode1
DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical ReasoningCode1
Show:102550
← PrevPage 16 of 64Next →

No leaderboard results yet.