| NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks | Apr 12, 2022 | Arithmetic ReasoningMathematical Reasoning | —Unverified | 0 | 0 |
| Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs | Dec 29, 2023 | Mathematical Reasoning | —Unverified | 0 | 0 |
| One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs | Feb 12, 2025 | Mathematical Reasoning | —Unverified | 0 | 0 |
| On-Policy RL with Optimal Reward Baseline | May 29, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 | 0 |
| On the meaning of uncertainty for ethical AI: philosophy and practice | Sep 11, 2023 | Decision MakingMathematical Reasoning | —Unverified | 0 | 0 |
| OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety | Mar 18, 2024 | BenchmarkingMathematical Reasoning | —Unverified | 0 | 0 |
| Optimizing Alignment with Less: Leveraging Data Augmentation for Personalized Evaluation | Dec 10, 2024 | Data AugmentationMathematical Reasoning | —Unverified | 0 | 0 |
| Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models | Jul 26, 2024 | Mathematical Reasoning | —Unverified | 0 | 0 |
| Orca 2: Teaching Small Language Models How to Reason | Nov 18, 2023 | Arithmetic ReasoningCommon Sense Reasoning | —Unverified | 0 | 0 |
| OSoRA: Output-Dimension and Singular-Value Initialized Low-Rank Adaptation | May 20, 2025 | Common Sense ReasoningMathematical Reasoning | —Unverified | 0 | 0 |