| Token-Supervised Value Models for Enhancing Mathematical Reasoning Capabilities of Large Language Models | Jul 12, 2024 | GSM8KMath | —Unverified | 0 |
| TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models | Jul 12, 2024 | Code GenerationMath | —Unverified | 0 |
| Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors | Jul 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On | Jul 11, 2024 | GSM8KMath | —Unverified | 0 |
| Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist | Jul 11, 2024 | GSM8KMath | —Unverified | 0 |
| ConvNLP: Image-based AI Text Detection | Jul 9, 2024 | Domain GeneralizationMath | —Unverified | 0 |
| Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models | Jul 9, 2024 | Math | CodeCode Available | 0 |
| Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns? | Jul 6, 2024 | Math | CodeCode Available | 0 |
| Smart Vision-Language Reasoners | Jul 5, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior | Jul 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |