| FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning | Oct 8, 2024 | GSM8KHallucination | —Unverified | 0 |
| Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering | Apr 27, 2023 | Mathematical Reasoning | —Unverified | 0 |
| A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models for Design Structure Matrix Optimization | Jun 11, 2025 | Combinatorial OptimizationMathematical Reasoning | —Unverified | 0 |
| Large Language Models Have Intrinsic Meta-Cognition, but Need a Good Lens | Jun 10, 2025 | BenchmarkingMathematical Reasoning | —Unverified | 0 |
| CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning | Jan 21, 2025 | ClusteringMathematical Reasoning | —Unverified | 0 |
| Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning | Jun 16, 2024 | BenchmarkingMath | —Unverified | 0 |
| ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment | May 25, 2025 | Code GenerationMathematical Reasoning | —Unverified | 0 |
| Exploring the Mystery of Influential Data for Mathematical Reasoning | Apr 1, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions | Apr 29, 2024 | Mathematical Reasoning | —Unverified | 0 |