| PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts | Jun 7, 2023 | Cross-Lingual Paraphrase IdentificationMachine Translation | —Unverified | 0 | 0 |
| Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap | Jan 5, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| PromptHive: Bringing Subject Matter Experts Back to the Forefront with Collaborative Prompt Engineering for Educational Content Creation | Oct 21, 2024 | MathPrompt Engineering | —Unverified | 0 | 0 |
| Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad | Mar 27, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| PRSA: Prompt Stealing Attacks against Real-World Prompt Services | Feb 29, 2024 | Math | —Unverified | 0 | 0 |
| Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers | May 7, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning | Jun 20, 2024 | GSM8KHeuristic Search | —Unverified | 0 | 0 |
| QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning | Aug 20, 2024 | BenchmarkingLanguage Modelling | —Unverified | 0 | 0 |
| Quantitative Methods for Optimizing Patient Outcomes in Liver Transplantation | May 31, 2023 | ManagementMath | —Unverified | 0 | 0 |
| An Improved Coarse-to-Fine Method for Solving Generation Tasks | Apr 1, 2019 | MathMath Word Problem Solving | —Unverified | 0 | 0 |