SOTAVerified

Math

Papers

Showing 401425 of 1596 papers

TitleStatusHype
Large (Vision) Language Models are Unsupervised In-Context LearnersCode1
Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model ReasoningCode1
FinanceMath: Knowledge-Intensive Math Reasoning in Finance DomainsCode1
JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem UnderstandingCode1
Efficient Neural Theorem Proving via Fine-grained Proof Structure AnalysisCode1
Diversify and Conquer: Diversity-Centric Data Selection with Iterative RefinementCode1
Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation CapabilitiesCode1
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step ReasoningCode1
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom InstructionCode1
Efficient Process Reward Model Training via Active LearningCode1
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis ModelsCode1
Non-myopic Generation of Language Models for Reasoning and PlanningCode1
Implicit Chain of Thought Reasoning via Knowledge DistillationCode1
DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized DocumentsCode1
Improving the Validity of Automatically Generated Feedback via Reinforcement LearningCode1
How to Get Your LLM to Generate Challenging Problems for EvaluationCode1
How well do Large Language Models perform in Arithmetic tasks?Code1
Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive PrinciplesCode1
HARP: A challenging human-annotated math reasoning benchmarkCode1
CLEVR-Math: A Dataset for Compositional Language, Visual and Mathematical ReasoningCode1
Efficient Reasoning for LLMs through Speculative Chain-of-ThoughtCode1
Injecting Numerical Reasoning Skills into Language ModelsCode1
Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word ProblemCode1
Graph-to-Tree Learning for Solving Math Word ProblemsCode1
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningCode1
Show:102550
← PrevPage 17 of 64Next →

No leaderboard results yet.