| Theorem Prover as a Judge for Synthetic Data Generation | Feb 18, 2025 | Mathematical ProofsMathematical Reasoning | —Unverified | 0 | 0 |
| Theoretical Analysis of an XGBoost Framework for Product Cannibalization | Dec 2, 2021 | Mathematical Reasoning | —Unverified | 0 | 0 |
| The Qiyas Benchmark: Measuring ChatGPT Mathematical and Language Understanding in Arabic | Jun 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| The Role of General Intelligence in Mathematical Reasoning | Apr 27, 2021 | Mathematical Reasoning | —Unverified | 0 | 0 |
| The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs | May 23, 2025 | Cross-Lingual TransferMath | —Unverified | 0 | 0 |
| Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners | Feb 27, 2025 | MambaMathematical Reasoning | —Unverified | 0 | 0 |
| Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains | May 22, 2025 | Mathematical ReasoningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| TinyGSM: achieving >80% on GSM8k with small language models | Dec 14, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 | 0 |
| Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Token-Level Uncertainty Estimation for Large Language Model Reasoning | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |