| Smiles in delta | Sep 1, 2022 | Math | —Unverified | 0 | 0 |
| SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model | Feb 4, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 | 0 |
| Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training | Nov 21, 2024 | Math | —Unverified | 0 | 0 |
| SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning | Mar 6, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| Solving Arithmetic Word Problems Using Transformer and Pre-processing of Problem Texts | Dec 1, 2020 | Math | —Unverified | 0 | 0 |
| Solving Arithmetic Word Problems with Transformers and Preprocessing of Problem Text | Jun 2, 2021 | Math | —Unverified | 0 | 0 |
| Solving Linear Algebra by Program Synthesis | Nov 16, 2021 | MathProgram Synthesis | —Unverified | 0 | 0 |
| Solving Linear Algebra by Program Synthesis | Nov 16, 2021 | MathProgram Synthesis | —Unverified | 0 | 0 |
| Heterogeneous Line Graph Transformer for Math Word Problems | Aug 11, 2022 | MathRepresentation Learning | —Unverified | 0 | 0 |
| Veracity Bias and Beyond: Uncovering LLMs' Hidden Beliefs in Problem-Solving Reasoning | May 22, 2025 | AttributeMath | —Unverified | 0 | 0 |
| Solving Math Word Problems with Double-Decoder Transformer | Aug 28, 2019 | DecoderMath | —Unverified | 0 | 0 |
| Solving math word problems with process- and outcome-based feedback | Nov 25, 2022 | Arithmetic ReasoningGSM8K | —Unverified | 0 | 0 |
| VGR: Visual Grounded Reasoning | Jun 13, 2025 | Large Language ModelMath | —Unverified | 0 | 0 |
| SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms | Jun 6, 2025 | DiversityLarge Language Model | —Unverified | 0 | 0 |
| Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling | Oct 15, 2024 | Instruction FollowingKnowledge Distillation | —Unverified | 0 | 0 |
| SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially? | Mar 16, 2025 | Board GamesCard Games | —Unverified | 0 | 0 |
| SplitReason: Learning To Offload Reasoning | Apr 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model | Jul 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| SSR: Speculative Parallel Scaling Reasoning in Test-time | May 21, 2025 | DiversityMath | —Unverified | 0 | 0 |
| Stable Code Technical Report | Apr 1, 2024 | Code CompletionLanguage Modelling | —Unverified | 0 | 0 |
| AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models | May 25, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| START: Self-taught Reasoner with Tools | Mar 6, 2025 | MathSelf-Learning | —Unverified | 0 | 0 |
| A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions | Dec 12, 2024 | GSM8KKnowledge Graphs | —Unverified | 0 | 0 |
| Steering LLM Reasoning Through Bias-Only Adaptation | May 24, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |