| The Perfect Blend: Redefining RLHF with Mixture of Judges | Sep 30, 2024 | Instruction FollowingMath | —Unverified | 0 |
| INC-Math: Integrating Natural Language and Code for Enhanced Mathematical Reasoning in Large Language Models | Sep 28, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Revisiting the Superficial Alignment Hypothesis | Sep 27, 2024 | Instruction FollowingMath | —Unverified | 0 |
| On the Inductive Bias of Stacking Towards Improving Reasoning | Sep 27, 2024 | Inductive BiasMath | —Unverified | 0 |
| BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search | Sep 26, 2024 | MathMathematical Problem-Solving | CodeCode Available | 1 |
| Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy | Sep 26, 2024 | Knowledge TracingMath | —Unverified | 0 |
| Democratizing Signal Processing and Machine Learning: Math Learning Equity for Elementary and Middle School Students | Sep 25, 2024 | Math | —Unverified | 0 |
| PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning | Sep 25, 2024 | GSM8KMath | —Unverified | 0 |
| LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ | Sep 25, 2024 | ChatbotGSM8K | —Unverified | 0 |
| Models Can and Should Embrace the Communicative Nature of Human-Generated Math | Sep 25, 2024 | Math | —Unverified | 0 |