| Query Auto Completion for Math Formula Search | Dec 9, 2019 | Math | —Unverified | 0 |
| QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation | Jul 17, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning | May 20, 2025 | MathOffline RL | —Unverified | 0 |
| Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement | Sep 18, 2024 | GSM8KMath | —Unverified | 0 |
| A Neural Network Implementation for Free Energy Principle | Jun 11, 2023 | Math | —Unverified | 0 |
| An Efficient Merge Search Matheuristic for Maximising the Net Present Value of Project Schedules | Oct 20, 2022 | MathScheduling | —Unverified | 0 |
| Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding | Feb 17, 2025 | Arithmetic ReasoningChart Understanding | —Unverified | 0 |
| RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs | May 22, 2025 | Image ManipulationMath | —Unverified | 0 |
| ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning | Oct 24, 2024 | GSM8KMath | —Unverified | 0 |
| Odd period cycles and ergodic properties in price dynamics for an exchange economy | Sep 17, 2023 | Math | —Unverified | 0 |
| ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs | Jun 23, 2025 | Math | —Unverified | 0 |
| Reasoning about Quantities in Natural Language | Jan 1, 2015 | MathNatural Language Inference | —Unverified | 0 |
| Reasoning-as-Logic-Units: Scaling Test-Time Reasoning in Large Language Models Through Logic Unit Alignment | Feb 5, 2025 | GSM8KHumanEval | —Unverified | 0 |
| Reasoning Like Program Executors | Nov 16, 2021 | Logical ReasoningMath | —Unverified | 0 |
| Reasoning Like Program Executors | Jan 27, 2022 | Logical ReasoningMath | —Unverified | 0 |
| Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification | Apr 7, 2025 | Logical ReasoningMath | —Unverified | 0 |
| Unit Dependency Graph and its Application to Arithmetic Word Problem Solving | Dec 3, 2016 | MathNatural Language Understanding | —Unverified | 0 |
| Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths | Oct 7, 2024 | AttributeGSM8K | —Unverified | 0 |
| Anchored Diffusion Language Model | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Augmented Benchmark Dataset for Geometric Question Answering through Dual Parallel Text Encoding | Oct 1, 2022 | Data AugmentationMath | —Unverified | 0 |
| Reasoning with Large Language Models, a Survey | Jul 16, 2024 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| Reasoning with Latent Thoughts: On the Power of Looped Transformers | Feb 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RSRM: Reinforcement Symbolic Regression Machine | May 24, 2023 | MathQ-Learning | —Unverified | 0 |
| AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling | Dec 19, 2024 | Math | —Unverified | 0 |
| Analyzing Non-Textual Content Elements to Detect Academic Plagiarism | Jun 10, 2021 | Mathtext similarity | —Unverified | 0 |
| Rectified Sparse Attention | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models | Jan 3, 2025 | GSM8KMath | —Unverified | 0 |
| Recursive Introspection: Teaching Language Model Agents How to Self-Improve | Jul 25, 2024 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints | Nov 22, 2023 | Computational EfficiencyMath | —Unverified | 0 |
| RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems? | Jan 20, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| RedStone: Curating General, Code, Math, and QA Data for Large Language Models | Dec 4, 2024 | Domain AdaptationMath | —Unverified | 0 |
| Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning | May 30, 2025 | Mathreinforcement-learning | —Unverified | 0 |
| Analytic solution of the SEIR epidemic model via asymptotic approximant | Jun 30, 2020 | FormMath | —Unverified | 0 |
| Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models | Oct 6, 2023 | 8kMath | —Unverified | 0 |
| Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem | Jun 3, 2025 | GPUMath | —Unverified | 0 |
| Reinforced optimal control | Nov 24, 2020 | Mathregression | —Unverified | 0 |
| Reinforce LLM Reasoning through Multi-Agent Reflection | Jun 10, 2025 | MathOut-of-Distribution Generalization | —Unverified | 0 |
| Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task | Apr 11, 2023 | Deep Reinforcement LearningExplainable artificial intelligence | —Unverified | 0 |
| A multi-core periphery perspective: Ranking via relative centrality | Jun 6, 2024 | Math | —Unverified | 0 |
| Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models | May 15, 2025 | Code GenerationGSM8K | —Unverified | 0 |
| Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval | Mar 22, 2023 | Adversarial RobustnessDeep Hashing | —Unverified | 0 |
| ReMI: A Dataset for Reasoning with Multiple Images | Jun 13, 2024 | Chart UnderstandingMath | —Unverified | 0 |
| WordSup: Exploiting Word Annotations for Character based Text Detection | Aug 22, 2017 | MathScene Text Detection | —Unverified | 0 |
| Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models | Oct 7, 2023 | Math | —Unverified | 0 |
| Rethink Delay Doppler Channels and Time-Frequency Coding | Dec 31, 2024 | Math | —Unverified | 0 |
| Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial? | Feb 2, 2025 | MathMMLU | —Unverified | 0 |
| Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading | Apr 16, 2025 | 2kCode Generation | —Unverified | 0 |
| Accurate closed-form solution of the SIR epidemic model | Apr 16, 2020 | FormMath | —Unverified | 0 |
| ReTool: Reinforcement Learning for Strategic Tool Use in LLMs | Apr 15, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures | Nov 25, 2024 | GSM8KMath | —Unverified | 0 |