SOTAVerified

Math

Papers

Showing 10761100 of 1596 papers

TitleStatusHype
Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators0
Scaling Test-Time Compute Without Verification or RL is Suboptimal0
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training0
Accelerating Neural Network Optimization Through an Automated Control Theory Lens0
Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine Text0
A Lean Dataset for International Math Olympiad: Small Steps towards Writing Math Proofs for Hard Problems0
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization0
Using Intermediate Representations to Solve Math Word Problems0
A Large Scale Quantitative Exploration of Modeling Strategies for Content Scoring0
Using Java Geometry Expert as Guide in the Preparations for Math Contests0
Self-Competitive Learning for Solving Math Word Problem0
Self-Consistency Boosts Calibration for Math Reasoning0
Using Large Language Models to Assess Tutors' Performance in Reacting to Students Making Math Errors0
Self-Consistency Preference Optimization0
Self-consistent Reasoning For Solving Math Word Problems0
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models0
A Knowledge-Aware Sequence-to-Tree Network for Math Word Problem Solving0
Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination0
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts0
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models0
Self-reinforced polynomial approximation methods for concentrated probability densities0
Self-Supervised Pretraining of Graph Neural Network for the Retrieval of Related Mathematical Expressions in Scientific Articles0
Using Large Language Model to Solve and Explain Physics Word Problems Approaching Human Level0
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs0
Utility-Driven Speculative Decoding for Mixture-of-Experts0
Show:102550
← PrevPage 44 of 64Next →

No leaderboard results yet.