| Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity | Aug 29, 2024 | Code GenerationDiversity | —Unverified | 0 |
| Building Math Agents with Multi-Turn Iterative Preference Learning | Sep 4, 2024 | GSM8KMath | —Unverified | 0 |
| A Bayesian model for recognizing handwritten mathematical expressions | Sep 18, 2014 | Mathmodel | —Unverified | 0 |
| InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion | Jan 6, 2025 | GSM8KHumanEval | —Unverified | 0 |
| Integer Networks for Data Compression with Latent-Variable Models | May 1, 2019 | Data CompressionMath | —Unverified | 0 |
| Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning | Dec 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles | May 26, 2025 | ARCLogical Reasoning | —Unverified | 0 |
| A Perspective on Large Language Models, Intelligent Machines, and Knowledge Acquisition | Aug 13, 2024 | Common Sense ReasoningMath | —Unverified | 0 |
| APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries | Apr 27, 2025 | Automated Theorem ProvingBug fixing | —Unverified | 0 |
| AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling | Dec 19, 2024 | Math | —Unverified | 0 |
| Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search | Jun 10, 2025 | GSM8KMath | —Unverified | 0 |
| GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach | Aug 18, 2023 | Math | —Unverified | 0 |
| BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Math Learning in an LMS Using AI-Driven Question Recommendations | Apr 18, 2025 | ManagementMath | —Unverified | 0 |
| Enhancing Mathematical Reasoning in LLMs with Background Operators | Dec 5, 2024 | Data AugmentationMath | —Unverified | 0 |
| Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens | Oct 18, 2024 | MathQuestion Answering | —Unverified | 0 |
| Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation | Nov 7, 2023 | MathRAG | —Unverified | 0 |
| Bridging Offline and Online Reinforcement Learning for LLMs | Jun 26, 2025 | Instruction FollowingMath | —Unverified | 0 |
| Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption | Feb 17, 2025 | BenchmarkingCode Summarization | —Unverified | 0 |
| End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics | Nov 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach | Jan 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Breaking Ties: Regression Discontinuity Design Meets Market Design | Dec 31, 2020 | Mathregression | —Unverified | 0 |
| 構建一個中文國小數學文字問題語料庫(Building a Corpus for Developing the Chinese Elementary School Math Word Problem Solver)[In Chinese] | Oct 1, 2016 | Math | —Unverified | 0 |
| AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy | Jun 16, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| In between myth and reality: AI for math -- a case study in category theory | Apr 17, 2025 | Math | —Unverified | 0 |