SOTAVerified

Math

Papers

Showing 476500 of 1596 papers

TitleStatusHype
From Zero to Hero: Convincing with Extremely Complicated MathCode1
Building Dataset for Grounding of Formulae — Annotating Coreference Relations Among Math IdentifiersCode1
A Relation Spectrum Inheriting Taylor Series: Muscle Synergy and Coupling for HandCode1
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle SolvingCode1
NeMo-Inspector: A Visualization Tool for LLM Generation AnalysisCode1
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language ModelsCode1
Neural-Symbolic Solver for Math Word Problems with Auxiliary TasksCode1
NLPBench: Evaluating Large Language Models on Solving NLP ProblemsCode1
Entropy-Regularized Process Reward ModelCode1
ArMATH: a Dataset for Solving Arabic Math Word ProblemsCode1
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of HeuristicsCode1
FELM: Benchmarking Factuality Evaluation of Large Language ModelsCode1
Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit GenerationCode1
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language ModelsCode1
Pretrained Language Models are Symbolic Mathematics Solvers too!Code1
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model EvaluationCode1
Problem-Oriented Segmentation and Retrieval: Case Study on Tutoring ConversationsCode1
Expression Syntax Information Bottleneck for Math Word ProblemsCode1
Evaluating and Improving Tool-Augmented Computation-Intensive Math ReasoningCode1
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-ThoughtsCode1
EXAONE Deep: Reasoning Enhanced Language ModelsCode1
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word ProblemsCode1
Explaining Datasets in Words: Statistical Models with Natural Language ParametersCode1
Are NLP Models really able to Solve Simple Math Word Problems?Code1
Case-Based or Rule-Based: How Do Transformers Do the Math?Code1
Show:102550
← PrevPage 20 of 64Next →

No leaderboard results yet.