SOTAVerified

Math

Papers

Showing 551575 of 1596 papers

TitleStatusHype
Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic CorpusCode2
Unlocking State-Tracking in Linear RNNs Through Negative EigenvaluesCode1
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMsCode0
RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic ProcessingCode0
Problem-Oriented Segmentation and Retrieval: Case Study on Tutoring ConversationsCode1
What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?Code1
UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding ThoughtsCode1
OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving?0
VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM0
Aioli: A Unified Optimization Framework for Language Model Data MixingCode1
Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams0
Meta-Reasoning Improves Tool Use in Large Language ModelsCode0
Self-Consistency Preference Optimization0
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology0
Leveraging Label Semantics and Meta-Label Refinement for Multi-Label Question ClassificationCode0
Regress, Don't Guess -- A Regression-like Loss on Number Tokens for Language ModelsCode1
Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models0
STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing0
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models0
Improving Math Problem Solving in Large Language Models Through Categorization and Strategy Tailoring0
Automated Feedback in Math Education: A Comparative Analysis of LLMs for Open-Ended Responses0
Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic ConsistencyCode1
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of HeuristicsCode1
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation0
Flaming-hot Initiation with Regular Execution Sampling for Large Language ModelsCode2
Show:102550
← PrevPage 23 of 64Next →

No leaderboard results yet.