| Strictly monotone mean-variance preferences with applications to portfolio selection | Dec 18, 2024 | ManagementMath | —Unverified | 0 |
| Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models | Dec 18, 2024 | HumanEvalImitation Learning | —Unverified | 0 |
| LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks | Dec 17, 2024 | Math | —Unverified | 0 |
| CoinMath: Harnessing the Power of Coding Instruction for Math LLMs | Dec 16, 2024 | DescriptiveMath | CodeCode Available | 0 |
| A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Combining Large Language Models with Tutoring System Intelligence: A Case Study in Caregiver Homework Support | Dec 16, 2024 | Large Language ModelMath | CodeCode Available | 0 |
| Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks | Dec 12, 2024 | DiversityGPU | —Unverified | 0 |
| Geo-LLaVA: A Large Multi-Modal Model for Solving Geometry Math Problems with Meta In-Context Learning | Dec 12, 2024 | Geometry Problem SolvingIn-Context Learning | —Unverified | 0 |
| Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator | Dec 12, 2024 | Math | —Unverified | 0 |
| A Context-Enhanced Framework for Sequential Graph Reasoning | Dec 12, 2024 | Math | CodeCode Available | 0 |
| A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions | Dec 12, 2024 | GSM8KKnowledge Graphs | —Unverified | 0 |
| Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation | Dec 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MNIST-Fraction: Enhancing Math Education with AI-Driven Fraction Detection and Analysis | Dec 11, 2024 | Math | —Unverified | 0 |
| LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation | Dec 10, 2024 | Math | CodeCode Available | 0 |
| When Dimensionality Reduction Meets Graph (Drawing) Theory: Introducing a Common Framework, Challenges and Opportunities | Dec 9, 2024 | Dimensionality ReductionMath | —Unverified | 0 |
| Mining Math Conjectures from LLMs: A Pruning Approach | Dec 9, 2024 | Math | —Unverified | 0 |
| Chimera: Improving Generalist Model with Domain-Specific Experts | Dec 8, 2024 | Mathmodel | —Unverified | 0 |
| Neuro-Symbolic Data Generation for Math Reasoning | Dec 6, 2024 | DiversityMath | —Unverified | 0 |
| Hard Math -- Easy UVM: Pragmatic solutions for verifying hardware algorithms using UVM | Dec 6, 2024 | Math | —Unverified | 0 |
| Automated LaTeX Code Generation from Handwritten Math Expressions Using Vision Transformer | Dec 5, 2024 | Code GenerationDecoder | —Unverified | 0 |
| Enhancing Mathematical Reasoning in LLMs with Background Operators | Dec 5, 2024 | Data AugmentationMath | —Unverified | 0 |
| RedStone: Curating General, Code, Math, and QA Data for Large Language Models | Dec 4, 2024 | Domain AdaptationMath | —Unverified | 0 |
| Unsupervised learning-based calibration scheme for Rough Bergomi model | Dec 3, 2024 | Math | CodeCode Available | 0 |
| MALT: Improving Reasoning with Multi-Agent LLM Training | Dec 2, 2024 | Common Sense ReasoningGSM8K | —Unverified | 0 |
| Yi-Lightning Technical Report | Dec 2, 2024 | ChatbotLarge Language Model | —Unverified | 0 |
| Reverse Thinking Makes LLMs Stronger Reasoners | Nov 29, 2024 | Data AugmentationKnowledge Distillation | —Unverified | 0 |
| Mars-PO: Multi-Agent Reasoning System Preference Optimization | Nov 28, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| A Lean Dataset for International Math Olympiad: Small Steps towards Writing Math Proofs for Hard Problems | Nov 28, 2024 | LEMMAMath | —Unverified | 0 |
| Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students | Nov 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS | Nov 27, 2024 | In-Context LearningMath | CodeCode Available | 0 |
| Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval | Nov 25, 2024 | MathMath Word Problem Solving | —Unverified | 0 |
| Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures | Nov 25, 2024 | GSM8KMath | —Unverified | 0 |
| Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training | Nov 21, 2024 | Math | —Unverified | 0 |
| MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs | Nov 14, 2024 | General KnowledgeMath | CodeCode Available | 0 |
| RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing | Nov 13, 2024 | DecoderMath | CodeCode Available | 0 |
| OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving? | Nov 9, 2024 | Logical ReasoningMath | —Unverified | 0 |
| VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM | Nov 8, 2024 | Math | —Unverified | 0 |
| Meta-Reasoning Improves Tool Use in Large Language Models | Nov 7, 2024 | Math | CodeCode Available | 0 |
| Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams | Nov 7, 2024 | Math | —Unverified | 0 |
| Self-Consistency Preference Optimization | Nov 6, 2024 | GSM8KMath | —Unverified | 0 |
| Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology | Nov 5, 2024 | MathMisconceptions | —Unverified | 0 |
| Leveraging Label Semantics and Meta-Label Refinement for Multi-Label Question Classification | Nov 4, 2024 | MathReranking | CodeCode Available | 0 |
| Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models | Nov 2, 2024 | GSM8KMath | —Unverified | 0 |
| STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing | Nov 1, 2024 | 2kIn-Context Learning | —Unverified | 0 |
| DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models | Oct 29, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Automated Feedback in Math Education: A Comparative Analysis of LLMs for Open-Ended Responses | Oct 29, 2024 | MathZero-Shot Learning | —Unverified | 0 |
| Improving Math Problem Solving in Large Language Models Through Categorization and Strategy Tailoring | Oct 29, 2024 | Math | —Unverified | 0 |
| EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation | Oct 28, 2024 | ARCMath | —Unverified | 0 |
| Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks? | Oct 27, 2024 | Data AugmentationMath | CodeCode Available | 0 |
| Library Learning Doesn't: The Curious Case of the Single-Use "Library" | Oct 26, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |