| PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts | Jun 7, 2023 | Cross-Lingual Paraphrase IdentificationMachine Translation | —Unverified | 0 |
| World Models for Math Story Problems | Jun 7, 2023 | Math | CodeCode Available | 0 |
| Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction | Jun 5, 2023 | Math | CodeCode Available | 1 |
| Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning | Jun 4, 2023 | Math | CodeCode Available | 1 |
| Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems? | Jun 3, 2023 | MathMath Word Problem Solving | CodeCode Available | 0 |
| MathChat: Converse to Tackle Challenging Math Problems with LLM Agents | Jun 2, 2023 | Elementary MathematicsMath | CodeCode Available | 1 |
| Learning Multi-Step Reasoning by Solving Arithmetic Tasks | Jun 2, 2023 | MathMathematical Reasoning | CodeCode Available | 1 |
| AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration | Jun 1, 2023 | Autonomous DrivingCloud Computing | CodeCode Available | 6 |
| Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions | Jun 1, 2023 | Math | —Unverified | 0 |