SOTAVerified

Math

Papers

Showing 9761000 of 1596 papers

TitleStatusHype
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision0
Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation0
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language ModelsCode1
Salsa Fresca: Angular Embeddings and Pre-Training for ML Attacks on Learning With Errors0
Large Language Models for Mathematical Reasoning: Progresses and Challenges0
Efficient Tool Use with Chain-of-Abstraction Reasoning0
Taxonomy of Mathematical PlagiarismCode0
ReGAL: Refactoring Programs to Discover Generalizable AbstractionsCode1
GAPS: Geometry-Aware Problem Solver0
YODA: Teacher-Student Progressive Learning for Language Models0
Exploring Educational Equity: A Machine Learning Approach to Unravel Achievement Disparities in Georgia0
Can AI Assistants Know What They Don't Know?Code2
TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic TasksCode1
Using Java Geometry Expert as Guide in the Preparations for Math Contests0
SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in ChineseCode2
Over-Reasoning and Redundant Calculation of Large Language ModelsCode1
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step ReasoningCode1
Augmenting Math Word Problems via Iterative Question ComposingCode1
Large Language Models Are Neurosymbolic ReasonersCode1
ReFT: Reasoning with Reinforced Fine-TuningCode4
Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided InterventionsCode1
Tuning Language Models by ProxyCode2
Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination0
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible PipelineCode3
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language ModelsCode2
Show:102550
← PrevPage 40 of 64Next →

No leaderboard results yet.