| LLMs can be easily Confused by Instructional Distractions | Feb 5, 2025 | Bias DetectionCode Generation | —Unverified | 0 |
| DavIR: Data Selection via Implicit Reward for Large Language Models | Oct 16, 2023 | Causal Language ModelingGSM8K | —Unverified | 0 |
| Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning | Dec 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles | May 26, 2025 | ARCLogical Reasoning | —Unverified | 0 |
| Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search | Jan 2, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models | Jun 6, 2024 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| Enhancing Neural Mathematical Reasoning by Abductive Combination with Symbolic Library | Mar 28, 2022 | Logical ReasoningMathematical Reasoning | —Unverified | 0 |
| Enhancing Mathematical Reasoning in LLMs with Background Operators | Dec 5, 2024 | Data AugmentationMath | —Unverified | 0 |
| Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations? | May 15, 2025 | Mathematical Reasoning | —Unverified | 0 |
| Enhancing Mathematical Reasoning in LLMs by Stepwise Correction | Oct 16, 2024 | Mathematical Reasoning | —Unverified | 0 |