| LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs | Jun 10, 2025 | Large Language ModelMath | —Unverified | 0 | 0 |
| CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks | Sep 13, 2024 | ARCCode Generation | —Unverified | 0 | 0 |
| Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia | Sep 13, 2024 | MathMultiple-choice | —Unverified | 0 | 0 |
| Cramer-Rao bound and absolute sensitivity in chemical reaction networks | Jan 13, 2024 | MathSensitivity | —Unverified | 0 | 0 |
| CRANE: Reasoning with constrained LLM generation | Feb 13, 2025 | Code GenerationMath | —Unverified | 0 | 0 |
| Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs | Mar 18, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference | Feb 8, 2021 | MathQuantization | —Unverified | 0 | 0 |
| Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic | Aug 29, 2024 | GSM8KLanguage Modeling | —Unverified | 0 | 0 |
| Critique Ability of Large Language Models | Oct 7, 2023 | Code CompletionDecision Making | —Unverified | 0 | 0 |
| Characterizing Student Engagement Moods for Dropout Prediction in Question Pool Websites | Jan 31, 2021 | Hybrid Machine LearningMath | —Unverified | 0 | 0 |