| Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models | Feb 27, 2024 | Dark Humor DetectionDialogue Generation | —Unverified | 0 |
| MMLU-SR: A Benchmark for Stress-Testing Reasoning Capability of Large Language Models | Jun 15, 2024 | Mathematical ReasoningMMLU | —Unverified | 0 |
| Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural Units | Apr 7, 2021 | Mathematical Reasoning | —Unverified | 0 |
| Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection | Mar 21, 2024 | Mathematical Reasoning | —Unverified | 0 |
| Reliable and Efficient Amortized Model-based Evaluation | Mar 17, 2025 | DiagnosticMathematical Reasoning | —Unverified | 0 |
| Reliable Natural Language Understanding with Large Language Models and Answer Set Programming | Feb 7, 2023 | Mathematical ReasoningNatural Language Understanding | —Unverified | 0 |
| Reliable Reasoning Beyond Natural Language | Jul 16, 2024 | GSM8KMathematical Reasoning | —Unverified | 0 |
| ReTool: Reinforcement Learning for Strategic Tool Use in LLMs | Apr 15, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning | Feb 20, 2025 | Mathematical ReasoningRetrieval | —Unverified | 0 |
| Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot | Jun 17, 2025 | In-Context LearningMathematical Reasoning | —Unverified | 0 |
| Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness | May 29, 2025 | DiversityLarge Language Model | —Unverified | 0 |
| Revisiting Overthinking in Long Chain-of-Thought from the Perspective of Self-Doubt | May 29, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation | Feb 27, 2025 | DiversityMathematical Reasoning | —Unverified | 0 |
| Revisiting Test-Time Scaling: A Survey and a Diversity-Aware Method for Efficient Reasoning | Jun 5, 2025 | DiversityMathematical Reasoning | —Unverified | 0 |
| Revisiting the Superficial Alignment Hypothesis | Sep 27, 2024 | Instruction FollowingMath | —Unverified | 0 |
| RL-finetuning LLMs from on- and off-policy data with a single algorithm | Mar 25, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions | Jun 7, 2024 | HallucinationMathematical Reasoning | —Unverified | 0 |
| RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library | Apr 29, 2025 | Data AugmentationMathematical Reasoning | —Unverified | 0 |
| S^3c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners | Sep 3, 2024 | GSM8KMath | —Unverified | 0 |
| SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models | Apr 5, 2024 | Mathematical Reasoning | —Unverified | 0 |
| Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking | Dec 12, 2024 | Mathematical Reasoning | —Unverified | 0 |
| Sample, Don't Search: Rethinking Test-Time Alignment for Language Models | Apr 4, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 |
| Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search | Feb 4, 2025 | Mathematical Reasoning | —Unverified | 0 |
| SAT Solvers and Computer Algebra Systems: A Powerful Combination for Mathematics | Jul 9, 2019 | Mathematical ProofsMathematical Reasoning | —Unverified | 0 |
| SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization | May 18, 2025 | MathMathematical Reasoning | —Unverified | 0 |