| Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving | Jan 28, 2025 | MathMathematical Problem-Solving | —Unverified | 0 |
| Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH | Jan 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning | Dec 7, 2023 | In-Context LearningMath | —Unverified | 0 |
| Benchmarking and Improving Generator-Validator Consistency of Language Models | Oct 3, 2023 | BenchmarkingInstruction Following | —Unverified | 0 |
| Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models | Oct 2, 2024 | Cross-Lingual TransferMath | —Unverified | 0 |
| Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks | Mar 14, 2024 | MathSkill Generalization | —Unverified | 0 |
| BeamLoRA: Beam-Constraint Low-Rank Adaptation | Feb 19, 2025 | Code GenerationMath | —Unverified | 0 |
| Basic concepts, definitions, and methods in D number theory | Mar 21, 2020 | Math | —Unverified | 0 |
| Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization | Feb 18, 2025 | Math | —Unverified | 0 |
| Backward bifurcation and saddle-node bifurcation in virus-immune dynamics | Dec 1, 2021 | Math | —Unverified | 0 |
| Learning Autonomous Code Integration for Math Language Models | Feb 2, 2025 | Math | —Unverified | 0 |
| Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs | May 24, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval | Nov 25, 2024 | MathMath Word Problem Solving | —Unverified | 0 |
| Token-Supervised Value Models for Enhancing Mathematical Reasoning Capabilities of Large Language Models | Jul 12, 2024 | GSM8KMath | —Unverified | 0 |
| Learning Fine-Grained Expressions to Solve Math Word Problems | Sep 1, 2017 | MathMath Word Problem Solving | —Unverified | 0 |
| Learning from Peers in Reasoning Models | May 12, 2025 | Math | —Unverified | 0 |
| Activation Functions Considered Harmful: Recovering Neural Network Weights through Controlled Channels | Mar 24, 2025 | Math | —Unverified | 0 |
| What Makes a Good Dataset for Symbol Description Reading? | Apr 17, 2023 | document understandingMath | —Unverified | 0 |
| Learning Hierarchical Structures On-The-Fly with a Recurrent-Recursive Model for Sequences | Jul 1, 2018 | Language ModelingLanguage Modelling | —Unverified | 0 |
| 151 Estrategias de Trading (151 Trading Strategies) | Nov 14, 2019 | DescriptiveMath | —Unverified | 0 |
| Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy | Sep 26, 2024 | Knowledge TracingMath | —Unverified | 0 |
| Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision | May 21, 2025 | GSM8KLearning-To-Rank | —Unverified | 0 |
| Learning to Reason Across Parallel Samples for LLM Reasoning | Jun 10, 2025 | MathRe-Ranking | —Unverified | 0 |
| Backup Control Barrier Functions: Formulation and Comparative Study | Apr 22, 2021 | Math | —Unverified | 0 |
| WARM: A Weakly (+Semi) Supervised Model for Solving Math word Problems | Apr 14, 2021 | Math | —Unverified | 0 |
| Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator | Dec 12, 2024 | Math | —Unverified | 0 |
| Least-to-Most Prompting Enables Complex Reasoning in Large Language Models | May 21, 2022 | Arithmetic ReasoningMath | —Unverified | 0 |
| Les mathématiques de la langue : l'approche formelle de Montague | May 16, 2014 | Math | —Unverified | 0 |
| Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning | Jun 25, 2023 | counterfactualMath | —Unverified | 0 |
| Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability | May 29, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Let's Reinforce Step by Step | Nov 10, 2023 | GSM8KLogical Reasoning | —Unverified | 0 |
| Let's reward step by step: Step-Level reward model as the Navigators for Reasoning | Oct 16, 2023 | Code GenerationGSM8K | —Unverified | 0 |
| Auto-regressive Text Generation with Pre-Trained Language Models: An Empirical Study on Question-type Short Text Generation | Jan 16, 2022 | MathText Generation | —Unverified | 0 |
| Leveraging Affect Transfer Learning for Behavior Prediction in an Intelligent Tutoring System | Feb 12, 2020 | MathTransfer Learning | —Unverified | 0 |
| Leveraging LLMs to Assess Tutor Moves in Real-Life Dialogues: A Feasibility Study | Jun 20, 2025 | Math | —Unverified | 0 |
| Leveraging Multimodal Dialog Technology for the Design of Automated and Interactive Student Agents for Teacher Training | Jul 1, 2018 | Math | —Unverified | 0 |
| Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models | Sep 11, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications | Feb 14, 2024 | Math | —Unverified | 0 |
| LEWIS (LayEr WIse Sparsity) -- A Training Free Guided Model Merging Approach | Mar 5, 2025 | Instruction FollowingMath | —Unverified | 0 |
| Limits of an AI program for solving college math problems | Aug 14, 2022 | Few-Shot LearningMath | —Unverified | 0 |
| Automatized Evaluation of Formalization Exercises in Mathematics | Jun 2, 2020 | MathSentence | —Unverified | 0 |
| LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks | Dec 17, 2024 | Math | —Unverified | 0 |
| Automatic tagging of knowledge points for K12 math problems | Aug 21, 2022 | ClassificationMath | —Unverified | 0 |
| Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers | Jun 5, 2025 | GSM8KMath | —Unverified | 0 |
| LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ | Sep 25, 2024 | ChatbotGSM8K | —Unverified | 0 |
| Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology | Nov 5, 2024 | MathMisconceptions | —Unverified | 0 |
| Meaning-Typed Programming: Language Abstraction and Runtime for Model-Integrated Applications | May 14, 2024 | GSM8KMath | —Unverified | 0 |
| LLMs as Potential Brainstorming Partners for Math and Science Problems | Oct 10, 2023 | Math | —Unverified | 0 |
| Automatic Generation of High Quality CCGbanks for Parser Domain Adaptation | Jun 5, 2019 | Domain AdaptationMath | —Unverified | 0 |
| LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought | May 9, 2024 | HallucinationMath | —Unverified | 0 |