SOTAVerified

Mathematical Problem-Solving

Papers

Showing 101106 of 106 papers

TitleStatusHype
Evaluating Language Models for Mathematics through InteractionsCode1
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark DatasetsCode1
Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in TransformersCode1
Measuring Mathematical Problem Solving With the MATH DatasetCode2
Scaling Laws for Autoregressive Generative Modeling0
Automatic Detection of Reflective Thinking in Mathematical Problem Solving based on Unconstrained Bodily Exploration0
Show:102550
← PrevPage 3 of 3Next →

No leaderboard results yet.