| A quantitative study of NLP approaches to question difficulty estimation | May 17, 2023 | MathMultiple-choice | CodeCode Available | 0 |
| A Probabilistic Model for Node Classification in Directed Graphs | Jan 3, 2025 | MathNode Classification | CodeCode Available | 0 |
| Inference-time Alignment in Continuous Space | May 26, 2025 | Math | CodeCode Available | 0 |
| WARM: A Weakly (+Semi) Supervised Math Word Problem Solver | Oct 1, 2022 | Math | CodeCode Available | 0 |
| Unlocking Temporal Question Answering for Large Language Models with Tailor-Made Reasoning Logic | May 24, 2023 | Logical ReasoningMath | CodeCode Available | 0 |
| RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing | Nov 13, 2024 | DecoderMath | CodeCode Available | 0 |
| Can LLMs Solve longer Math Word Problems Better? | May 23, 2024 | Data AugmentationMath | CodeCode Available | 0 |
| Tracing and Manipulating Intermediate Values in Neural Math Problem Solvers | Jan 17, 2023 | Math | CodeCode Available | 0 |
| Evaluating and Optimizing Educational Content with Large Language Model Judgments | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL | Sep 21, 2024 | MathText to SQL | CodeCode Available | 0 |
| Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning | Feb 11, 2025 | Code GenerationMath | CodeCode Available | 0 |
| MathScale: Scaling Instruction Tuning for Mathematical Reasoning | Mar 5, 2024 | GSM8KMath | CodeCode Available | 0 |
| MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark | Aug 14, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance | Oct 3, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 0 |
| Instructing Large Language Models to Identify and Ignore Irrelevant Conditions | Mar 19, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Rethinking floating point for deep learning | Nov 1, 2018 | Deep LearningMath | CodeCode Available | 0 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 |
| Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation | Apr 20, 2020 | Deep LearningMath | CodeCode Available | 0 |
| Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning | Jan 24, 2018 | CPUDeep Learning | CodeCode Available | 0 |
| Public Attitudes Toward ChatGPT on Twitter: Sentiments, Topics, and Occupations | Jun 22, 2023 | ChatbotLanguage Modelling | CodeCode Available | 0 |
| Coarse-grained Stochastic Model of Myosin-Driven Vesicles into Dendritic Spines | Jul 15, 2021 | Math | CodeCode Available | 0 |
| Solving Arithmetic Word Problems Automatically Using Transformer and Unambiguous Representations | Dec 2, 2019 | Math | CodeCode Available | 0 |
| Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS | Nov 27, 2024 | In-Context LearningMath | CodeCode Available | 0 |
| Can LLMs Reason in the Wild with Programs? | Jun 19, 2024 | GSM8KMath | CodeCode Available | 0 |
| Unbiased Math Word Problems Benchmark for Mitigating Solving Bias | May 17, 2022 | Math | CodeCode Available | 0 |