SOTAVerified

Math

Papers

Showing 11511175 of 1596 papers

TitleStatusHype
RevOrder: A Novel Method for Enhanced Arithmetic in Language Models0
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision0
Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation0
Salsa Fresca: Angular Embeddings and Pre-Training for ML Attacks on Learning With Errors0
Large Language Models for Mathematical Reasoning: Progresses and Challenges0
Efficient Tool Use with Chain-of-Abstraction Reasoning0
Taxonomy of Mathematical PlagiarismCode0
GAPS: Geometry-Aware Problem Solver0
YODA: Teacher-Student Progressive Learning for Language Models0
Exploring Educational Equity: A Machine Learning Approach to Unravel Achievement Disparities in Georgia0
Using Java Geometry Expert as Guide in the Preparations for Math Contests0
Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination0
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities0
Cramer-Rao bound and absolute sensitivity in chemical reaction networks0
Using Large Language Models to Assess Tutors' Performance in Reacting to Students Making Math Errors0
Graph2Tac: Online Representation Learning of Formal Math Concepts0
Mastery Guided Non-parametric Clustering to Scale-up Strategy Prediction0
Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities0
From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting0
TinyGSM: achieving >80% on GSM8k with small language models0
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning0
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models0
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning0
ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math QuestionsCode0
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints0
Show:102550
← PrevPage 47 of 64Next →

No leaderboard results yet.