SOTAVerified

Math

Papers

Showing 13761400 of 1596 papers

TitleStatusHype
The Invalsi Benchmarks: measuring Linguistic and Mathematical understanding of Large Language Models in Italian0
Weakest Link in the Chain: Security Vulnerabilities in Advanced Reasoning Models0
First-Step Advantage: Importance of Starting Right in Multi-Step Math Reasoning0
Fixation probabilities for the Moran process in evolutionary games with two strategies: graph shapes and large population asymptotics0
Fixation probabilities for the Moran process with three or more strategies: general and coupling results0
Building Math Agents with Multi-Turn Iterative Preference Learning0
Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration0
The Logic of Political Survival Revisited: Consequences of Elite Uncertainty Under Authoritarian Rule0
Formal Mathematical Reasoning: A New Frontier in AI0
The Long-Term Effects of Teachers' Gender Stereotypes0
fPLSA: Learning Semantic Structures in Document Collections Using Foundation Models0
FRACTAL: Fine-Grained Scoring from Aggregate Text Labels0
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning0
From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems0
From fixation probabilities to d-player games: an inverse problem in evolutionary dynamics0
The Mathematics of Market Timing0
From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting0
From Large to Tiny: Distilling and Refining Mathematical Expertise for Math Word Problems with Weakly Supervision0
From Textbooks to Knowledge: A Case Study in Harvesting Axiomatic Knowledge from Textbooks to Solve Geometry Problems0
From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics0
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens0
Bridging Offline and Online Reinforcement Learning for LLMs0
Breaking Ties: Regression Discontinuity Design Meets Market Design0
Gamifying Math Education using Object Detection0
GAPS: Geometry-Aware Problem Solver0
Show:102550
← PrevPage 56 of 64Next →

No leaderboard results yet.