| Big Math and the One-Brain Barrier A Position Paper and Architecture Proposal | Apr 23, 2019 | MathPosition | —Unverified | 0 |
| DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models | Oct 29, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces | Oct 13, 2024 | Computational EfficiencyMath | —Unverified | 0 |
| Accurate closed-form solution of the SIR epidemic model | Apr 16, 2020 | FormMath | —Unverified | 0 |
| SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning | May 16, 2025 | Math | —Unverified | 0 |
| LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning | Dec 7, 2023 | In-Context LearningMath | —Unverified | 0 |
| Biased Programmers? Or Biased Data? A Field Experiment in Operationalizing AI Ethics | Dec 4, 2020 | EthicsMath | —Unverified | 0 |
| DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images | Jan 24, 2025 | Math | —Unverified | 0 |
| Do Thinking Tokens Help or Trap? Towards More Efficient Large Reasoning Model | Jun 30, 2025 | Math | —Unverified | 0 |
| An Improved Coarse-to-Fine Method for Solving Generation Tasks | Apr 1, 2019 | MathMath Word Problem Solving | —Unverified | 0 |
| A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications | Jan 9, 2025 | MathRAG | —Unverified | 0 |
| Large Language Models Can Self-Correct with Key Condition Verification | May 23, 2024 | Arithmetic ReasoningMath | —Unverified | 0 |
| Large Language Models for Mathematical Reasoning: Progresses and Challenges | Jan 31, 2024 | DiversityMath | —Unverified | 0 |
| Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition | May 26, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Dolphin: A Spoken Language Proficiency Assessment System for Elementary Education | Aug 1, 2019 | Math | —Unverified | 0 |
| Beyond Sentential Semantic Parsing: Tackling the Math SAT with a Cascade of Tree Transducers | Sep 1, 2017 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration From Cognitive Psychology | Oct 19, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models | Dec 11, 2023 | DiversityMath | —Unverified | 0 |
| Does Representation Intervention Really Identify Desired Concepts and Elicit Alignment? | May 24, 2025 | Code GenerationMath | —Unverified | 0 |
| Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? | Apr 18, 2025 | MathVisual Reasoning | —Unverified | 0 |
| Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets | Apr 28, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| Does Reasoning Introduce Bias? A Study of Social Bias Evaluation and Mitigation in LLM Reasoning | Feb 21, 2025 | Math | —Unverified | 0 |
| Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models | Aug 15, 2024 | Math | —Unverified | 0 |
| Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning | Oct 8, 2024 | Image RetrievalMath | —Unverified | 0 |
| LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs | Jun 10, 2025 | Large Language ModelMath | —Unverified | 0 |
| Large Language Models as Analogical Reasoners | Oct 3, 2023 | Code GenerationGSM8K | —Unverified | 0 |
| Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions | Aug 16, 2024 | DescriptiveHallucination | —Unverified | 0 |
| A Neural Network Implementation for Free Energy Principle | Jun 11, 2023 | Math | —Unverified | 0 |
| dMath: Distributed Linear Algebra for DL | Nov 19, 2016 | GPUManagement | —Unverified | 0 |
| dMath: A Scalable Linear Algebra and Math Library for Heterogeneous GP-GPU Architectures | Apr 5, 2016 | GPUManagement | —Unverified | 0 |
| Language Models with Conformal Factuality Guarantees | Feb 15, 2024 | Conformal PredictionLanguage Modeling | —Unverified | 0 |
| Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation | May 30, 2024 | Code GenerationHumanEval | —Unverified | 0 |
| Better Process Supervision with Bi-directional Rewarding Signals | Mar 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DiversiGATE: A Comprehensive Framework for Reliable Large Language Models | Jun 22, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 |
| Benchmarking Reasoning Robustness in Large Language Models | Mar 6, 2025 | BenchmarkingMath | —Unverified | 0 |
| Distributed Skellam Mechanism: a Novel Approach to Federated Learning with Differential Privacy | Sep 29, 2021 | Federated LearningMath | —Unverified | 0 |
| Advancing Process Verification for Large Language Models via Tree-Based Preference Learning | Jun 29, 2024 | Binary ClassificationGSM8K | —Unverified | 0 |
| DISK: Domain-constrained Instance Sketch for Math Word Problem Generation | Apr 10, 2022 | Math | —Unverified | 0 |
| DISC: DISC: Dynamic Decomposition Improves LLM Inference Scaling | Feb 23, 2025 | Computational EfficiencyMath | —Unverified | 0 |
| Benchmarking and Improving Generator-Validator Consistency of Language Models | Oct 3, 2023 | BenchmarkingInstruction Following | —Unverified | 0 |
| Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks | Jun 16, 2025 | FormMath | —Unverified | 0 |
| Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks | Dec 12, 2024 | DiversityGPU | —Unverified | 0 |
| An Efficient Merge Search Matheuristic for Maximising the Net Present Value of Project Schedules | Oct 20, 2022 | MathScheduling | —Unverified | 0 |
| DINGO: Constrained Inference for Diffusion LLMs | May 29, 2025 | Math | —Unverified | 0 |
| Dimension Reduction via Colour Refinement | Jul 22, 2013 | Dimensionality ReductionIsomorphism Testing | —Unverified | 0 |
| BeamLoRA: Beam-Constraint Low-Rank Adaptation | Feb 19, 2025 | Code GenerationMath | —Unverified | 0 |
| Dimensionality reduction: theoretical perspective on practical measures | Dec 1, 2019 | BIG-bench Machine LearningDimensionality Reduction | —Unverified | 0 |
| Digenes: genetic algorithms to discover conjectures about directed and undirected graphs | Apr 30, 2013 | Math | —Unverified | 0 |
| Basic concepts, definitions, and methods in D number theory | Mar 21, 2020 | Math | —Unverified | 0 |
| Odd period cycles and ergodic properties in price dynamics for an exchange economy | Sep 17, 2023 | Math | —Unverified | 0 |