| Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus | Nov 19, 2024 | Formal LogicLogical Reasoning | CodeCode Available | 2 |
| Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs | Nov 14, 2024 | General KnowledgeMath | CodeCode Available | 0 |
| RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing | Nov 13, 2024 | DecoderMath | CodeCode Available | 0 |
| Problem-Oriented Segmentation and Retrieval: Case Study on Tutoring Conversations | Nov 12, 2024 | MathRetrieval | CodeCode Available | 1 |
| What Do Learning Dynamics Reveal About Generalization in LLM Reasoning? | Nov 12, 2024 | GSM8KMath | CodeCode Available | 1 |
| UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts | Nov 11, 2024 | Code GenerationGSM8K | CodeCode Available | 1 |
| OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving? | Nov 9, 2024 | Logical ReasoningMath | —Unverified | 0 |
| VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM | Nov 8, 2024 | Math | —Unverified | 0 |
| Aioli: A Unified Optimization Framework for Language Model Data Mixing | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |