| Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation | Dec 20, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Offline Reinforcement Learning for LLM Multi-Step Reasoning | Dec 20, 2024 | GSM8KMath | CodeCode Available | 2 |
| Formal Mathematical Reasoning: A New Frontier in AI | Dec 20, 2024 | Automated Theorem ProvingMath | —Unverified | 0 |
| Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning | Dec 19, 2024 | Math | —Unverified | 0 |
| Qwen2.5 Technical Report | Dec 19, 2024 | Common Sense Reasoning | CodeCode Available | 13 |
| Conceptual In-Context Learning and Chain of Concepts: Solving Complex Conceptual Problems Using Large Language Models | Dec 19, 2024 | In-Context LearningMath | —Unverified | 0 |
| AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling | Dec 19, 2024 | Math | —Unverified | 0 |
| Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying | Dec 19, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models | Dec 18, 2024 | HumanEvalImitation Learning | —Unverified | 0 |
| Strictly monotone mean-variance preferences with applications to portfolio selection | Dec 18, 2024 | ManagementMath | —Unverified | 0 |
| LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks | Dec 17, 2024 | Math | —Unverified | 0 |
| CoinMath: Harnessing the Power of Coding Instruction for Math LLMs | Dec 16, 2024 | DescriptiveMath | CodeCode Available | 0 |
| A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Combining Large Language Models with Tutoring System Intelligence: A Case Study in Caregiver Homework Support | Dec 16, 2024 | Large Language ModelMath | CodeCode Available | 0 |
| Entropy-Regularized Process Reward Model | Dec 15, 2024 | GSM8KMath | CodeCode Available | 1 |
| Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks | Dec 12, 2024 | DiversityGPU | —Unverified | 0 |
| Geo-LLaVA: A Large Multi-Modal Model for Solving Geometry Math Problems with Meta In-Context Learning | Dec 12, 2024 | Geometry Problem SolvingIn-Context Learning | —Unverified | 0 |
| A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions | Dec 12, 2024 | GSM8KKnowledge Graphs | —Unverified | 0 |
| Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator | Dec 12, 2024 | Math | —Unverified | 0 |
| A Context-Enhanced Framework for Sequential Graph Reasoning | Dec 12, 2024 | Math | CodeCode Available | 0 |
| Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation | Dec 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| HARP: A challenging human-annotated math reasoning benchmark | Dec 11, 2024 | Math | CodeCode Available | 1 |
| MNIST-Fraction: Enhancing Math Education with AI-Driven Fraction Detection and Analysis | Dec 11, 2024 | Math | —Unverified | 0 |
| LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation | Dec 10, 2024 | Math | CodeCode Available | 0 |
| Mining Math Conjectures from LLMs: A Pruning Approach | Dec 9, 2024 | Math | —Unverified | 0 |