| Leveraging Constrained Monte Carlo Tree Search to Generate Reliable Long Chain-of-Thought for Mathematical Reasoning | Feb 16, 2025 | Mathematical Reasoning | —Unverified | 0 |
| LexPam: Legal Procedure Awareness-Guided Mathematical Reasoning | Apr 3, 2025 | Mathematical ReasoningQuestion Answering | —Unverified | 0 |
| LiteSearch: Efficacious Tree Search for LLM | Jun 29, 2024 | GSM8KMathematical Reasoning | —Unverified | 0 |
| LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ | Sep 25, 2024 | ChatbotGSM8K | —Unverified | 0 |
| LLM4DV: Using Large Language Models for Hardware Test Stimuli Generation | Oct 6, 2023 | BenchmarkingMathematical Reasoning | —Unverified | 0 |
| LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems | Apr 3, 2025 | Mathematical Reasoning | —Unverified | 0 |
| LLM Library Learning Fails: A LEGO-Prover Case Study | Apr 3, 2025 | Mathematical ReasoningMisconceptions | —Unverified | 0 |
| LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning | Dec 28, 2024 | Mathematical Reasoning | —Unverified | 0 |
| LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement | Jun 29, 2024 | Contrastive LearningMathematical Reasoning | —Unverified | 0 |
| LLMs can be easily Confused by Instructional Distractions | Feb 5, 2025 | Bias DetectionCode Generation | —Unverified | 0 |
| LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought | May 9, 2024 | HallucinationMath | —Unverified | 0 |
| LLMs can implicitly learn from mistakes in-context | Feb 12, 2025 | Mathematical Reasoning | —Unverified | 0 |
| DavIR: Data Selection via Implicit Reward for Large Language Models | Oct 16, 2023 | Causal Language ModelingGSM8K | —Unverified | 0 |
| Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems | Aug 29, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models | Feb 24, 2024 | GSM8KMathematical Reasoning | —Unverified | 0 |
| Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts | Jun 24, 2024 | Mathematical ReasoningVisual Question Answering (VQA) | —Unverified | 0 |
| Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models | Dec 13, 2024 | Mathematical Reasoning | —Unverified | 0 |
| LPML: LLM-Prompting Markup Language for Mathematical Reasoning | Sep 21, 2023 | Mathematical Reasoning | —Unverified | 0 |
| Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection | Nov 13, 2024 | Code GenerationMathematical Reasoning | —Unverified | 0 |
| Machine learning and information theory concepts towards an AI Mathematician | Mar 7, 2024 | Mathematical Reasoning | —Unverified | 0 |
| MAPS: A Multilingual Benchmark for Global Agent Performance and Security | May 21, 2025 | Code GenerationMath | —Unverified | 0 |
| Markov Chain of Thought for Efficient Mathematical Reasoning | Oct 23, 2024 | Mathematical Reasoning | —Unverified | 0 |
| Mars-PO: Multi-Agent Reasoning System Preference Optimization | Nov 28, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality | Jun 17, 2025 | Code GenerationMathematical Reasoning | —Unverified | 0 |
| MATATA: Weakly Supervised End-to-End MAthematical Tool-Augmented Reasoning for Tabular Applications | Nov 28, 2024 | document understandingMathematical Reasoning | —Unverified | 0 |