| Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic | Feb 19, 2024 | Instruction FollowingMath | CodeCode Available | 2 |
| Reformatted Alignment | Feb 19, 2024 | GSM8KHallucination | CodeCode Available | 2 |
| LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks | Feb 18, 2024 | Math | —Unverified | 0 |
| Orca-Math: Unlocking the potential of SLMs in Grade School Math | Feb 16, 2024 | Arithmetic ReasoningGSM8K | —Unverified | 0 |
| Language Models as Science Tutors | Feb 16, 2024 | GSM8KMath | CodeCode Available | 1 |
| Language Models with Conformal Factuality Guarantees | Feb 15, 2024 | Conformal PredictionLanguage Modeling | —Unverified | 0 |
| Mathematical Opportunities in Digital Twins (MATH-DT) | Feb 15, 2024 | Math | —Unverified | 0 |
| OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset | Feb 15, 2024 | Arithmetic ReasoningGSM8K | CodeCode Available | 4 |
| GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving | Feb 15, 2024 | Geometry Problem SolvingMath | CodeCode Available | 1 |
| AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails | Feb 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |