| Solving Functional Optimization with Deep Networks and Variational Principles | Oct 8, 2024 | Math | —Unverified | 0 |
| Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs | Jan 21, 2025 | GSM8KIn-Context Learning | —Unverified | 0 |
| Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition | May 26, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Dolphin: A Spoken Language Proficiency Assessment System for Elementary Education | Aug 1, 2019 | Math | —Unverified | 0 |
| Beyond Sentential Semantic Parsing: Tackling the Math SAT with a Cascade of Tree Transducers | Sep 1, 2017 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration From Cognitive Psychology | Oct 19, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models | Dec 11, 2023 | DiversityMath | —Unverified | 0 |
| Does Representation Intervention Really Identify Desired Concepts and Elicit Alignment? | May 24, 2025 | Code GenerationMath | —Unverified | 0 |
| Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? | Apr 18, 2025 | MathVisual Reasoning | —Unverified | 0 |
| Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets | Apr 28, 2025 | Data AugmentationDiversity | —Unverified | 0 |