SOTAVerified

Math

Papers

Showing 301325 of 1596 papers

TitleStatusHype
Forgotten Polygons: Multimodal Large Language Models are Shape-BlindCode1
MathPrompter: Mathematical Reasoning using Large Language ModelsCode1
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical ReasoningCode1
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human AnnotationsCode1
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle SolvingCode1
Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit GenerationCode1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward PassesCode1
From GAN to WGANCode1
Automatic Generation of Socratic Subquestions for Teaching Math Word ProblemsCode1
Expression Syntax Information Bottleneck for Math Word ProblemsCode1
MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning ProblemsCode1
Mathematical Capabilities of ChatGPTCode1
A Diverse Corpus for Evaluating and Developing English Math Word Problem SolversCode1
MathGloss: Building mathematical glossaries from textCode1
From Zero to Hero: Convincing with Extremely Complicated MathCode1
FELM: Benchmarking Factuality Evaluation of Large Language ModelsCode1
Math-KG: Construction and Applications of Mathematical Knowledge GraphCode1
MathViz-E: A Case-study in Domain-Specialized Tool-Using AgentsCode1
Evolving Prompts In-Context: An Open-ended, Self-replicating PerspectiveCode1
MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics EducationCode1
Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic ConsistencyCode1
Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational CurriculaCode1
AutoBencher: Creating Salient, Novel, Difficult Datasets for Language ModelsCode1
Augmenting Math Word Problems via Iterative Question ComposingCode1
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual DependencyCode1
Show:102550
← PrevPage 13 of 64Next →

No leaderboard results yet.