| Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation | May 29, 2025 | GSM8KMath | —Unverified | 0 |
| A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions | Dec 12, 2024 | GSM8KKnowledge Graphs | —Unverified | 0 |
| HyperCLOVA X Technical Report | Apr 2, 2024 | Instruction FollowingMachine Translation | —Unverified | 0 |
| Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics | Apr 24, 2025 | Code GenerationMath | —Unverified | 0 |
| Human Learning about AI | Jun 8, 2024 | Math | —Unverified | 0 |
| Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams | Nov 7, 2024 | Math | —Unverified | 0 |
| A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science | Mar 21, 2024 | Active LearningMath | —Unverified | 0 |
| Hydrodynamics of Markets:Hidden Links Between Physics and Finance | Mar 14, 2024 | Math | —Unverified | 0 |
| Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models | Feb 17, 2025 | Math | —Unverified | 0 |
| Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and Citations | Jun 27, 2019 | Math | —Unverified | 0 |
| Can I understand what I create? Self-Knowledge Evaluation of Large Language Models | Jun 10, 2024 | Math | —Unverified | 0 |
| Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate | May 22, 2023 | BenchmarkingMath | —Unverified | 0 |
| A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio | Sep 10, 2024 | Emotional IntelligenceMath | —Unverified | 0 |
| Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework | Jan 26, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning | May 22, 2025 | Mathreinforcement-learning | —Unverified | 0 |
| How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation | Aug 1, 2016 | Community Question AnsweringMath | —Unverified | 0 |
| EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation | Oct 28, 2024 | ARCMath | —Unverified | 0 |
| Approximation properties of Residual Neural Networks for Kolmogorov PDEs | Oct 30, 2021 | image-classificationImage Classification | —Unverified | 0 |
| Entropy Martingale Optimal Transport and Nonlinear Pricing-Hedging Duality | May 26, 2020 | Math | —Unverified | 0 |
| Calculus on MDPs: Potential Shaping as a Gradient | Aug 20, 2022 | Math | —Unverified | 0 |
| Approximating Sparse PCA from Incomplete Data | Mar 12, 2015 | Math | —Unverified | 0 |
| Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation | Apr 16, 2025 | GSM8KMath | —Unverified | 0 |
| BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems | Mar 18, 2025 | CPUMath | —Unverified | 0 |
| Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference | Feb 5, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity | Aug 29, 2024 | Code GenerationDiversity | —Unverified | 0 |