SOTAVerified

Math

Papers

Showing 13511375 of 1596 papers

TitleStatusHype
The Hallucination Tax of Reinforcement Finetuning0
Explaining Math Word Problem Solvers0
Explain with Visual Keypoints Like a Real Mentor! A Benchmark for Multimodal Solution Explanation0
Explanation Generation for a Math Word Problem Solver0
Explicit Knowledge Transfer for Weakly-Supervised Code Generation0
Exploring Educational Equity: A Machine Learning Approach to Unravel Achievement Disparities in Georgia0
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate0
Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them0
Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases0
Calculus on MDPs: Potential Shaping as a Gradient0
Exploring the Mystery of Influential Data for Mathematical Reasoning0
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning0
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity0
Extracting the Unknown from Long Math Problems0
Fairness Hub Technical Briefs: AUC Gap0
Fairshare Data Pricing via Data Valuation for Large Language Models0
FANS -- Formal Answer Selection for Natural Language Math Reasoning Using Lean40
BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems0
Fast Diffusion Inhibits Disease Outbreaks0
Faster and Better LLMs via Latency-Aware Test-Time Scaling0
Feature Selection Based on Confidence Machine0
The Impact of Item-Writing Flaws on Difficulty and Discrimination in Item Response Theory0
Few-Shot Recalibration of Language Models0
FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning0
FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models0
Show:102550
← PrevPage 55 of 64Next →

No leaderboard results yet.