| Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators | Mar 25, 2025 | Math | —Unverified | 0 | 0 |
| Scaling Test-Time Compute Without Verification or RL is Suboptimal | Feb 17, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training | Jul 16, 2025 | Code GenerationMath | —Unverified | 0 | 0 |
| Accelerating Neural Network Optimization Through an Automated Control Theory Lens | Jan 1, 2022 | Math | —Unverified | 0 | 0 |
| Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine Text | Jul 2, 2021 | MathText Generation | —Unverified | 0 | 0 |
| A Lean Dataset for International Math Olympiad: Small Steps towards Writing Math Proofs for Hard Problems | Nov 28, 2024 | LEMMAMath | —Unverified | 0 | 0 |
| SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization | May 18, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| Using Intermediate Representations to Solve Math Word Problems | Jul 1, 2018 | MathMath Word Problem Solving | —Unverified | 0 | 0 |
| A Large Scale Quantitative Exploration of Modeling Strategies for Content Scoring | Sep 1, 2017 | Math | —Unverified | 0 | 0 |
| Using Java Geometry Expert as Guide in the Preparations for Math Contests | Jan 22, 2024 | Math | —Unverified | 0 | 0 |
| Self-Competitive Learning for Solving Math Word Problem | Nov 16, 2021 | Math | —Unverified | 0 | 0 |
| Self-Consistency Boosts Calibration for Math Reasoning | Mar 14, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Using Large Language Models to Assess Tutors' Performance in Reacting to Students Making Math Errors | Jan 6, 2024 | Math | —Unverified | 0 | 0 |
| Self-Consistency Preference Optimization | Nov 6, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Self-consistent Reasoning For Solving Math Word Problems | Oct 27, 2022 | Math | —Unverified | 0 | 0 |
| Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models | Mar 4, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| A Knowledge-Aware Sequence-to-Tree Network for Math Word Problem Solving | Nov 1, 2020 | Common Sense ReasoningDecoder | —Unverified | 0 | 0 |
| Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination | Jan 16, 2024 | GSM8KLanguage Modeling | —Unverified | 0 | 0 |
| Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts | Jun 17, 2024 | Math | —Unverified | 0 | 0 |
| Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models | May 1, 2024 | Math | —Unverified | 0 | 0 |
| Self-reinforced polynomial approximation methods for concentrated probability densities | Mar 5, 2023 | Math | —Unverified | 0 | 0 |
| Self-Supervised Pretraining of Graph Neural Network for the Retrieval of Related Mathematical Expressions in Scientific Articles | Aug 22, 2022 | ArticlesGraph Neural Network | —Unverified | 0 | 0 |
| Using Large Language Model to Solve and Explain Physics Word Problems Approaching Human Level | Sep 15, 2023 | Few-Shot LearningHigh School Physics | —Unverified | 0 | 0 |
| Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs | May 19, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 | 0 |
| Utility-Driven Speculative Decoding for Mixture-of-Experts | Jun 17, 2025 | GPULarge Language Model | —Unverified | 0 | 0 |