| MathScale: Scaling Instruction Tuning for Mathematical Reasoning | Mar 5, 2024 | GSM8KMath | CodeCode Available | 0 |
| Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning | Mar 4, 2024 | GSM8KMath | —Unverified | 0 |
| The Claude 3 Model Family: Opus, Sonnet, Haiku | Mar 4, 2024 | 1 Image, 2*2 StitchingArithmetic Reasoning | —Unverified | 0 |
| Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training | Mar 4, 2024 | MathPhrase Grounding | —Unverified | 0 |
| Experimenting with Generative AI: Does ChatGPT Really Increase Everyone's Productivity? | Mar 4, 2024 | EconometricsMath | —Unverified | 0 |
| ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data | Mar 1, 2024 | Math | —Unverified | 0 |
| PRSA: Prompt Stealing Attacks against Real-World Prompt Services | Feb 29, 2024 | Math | —Unverified | 0 |
| Data Interpreter: An LLM Agent For Data Science | Feb 28, 2024 | Code GenerationLanguage Modelling | —Unverified | 0 |
| Adversarial Math Word Problem Generation | Feb 27, 2024 | Math | CodeCode Available | 0 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 |
| MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs | Feb 26, 2024 | GSM8KMath | —Unverified | 0 |
| How Do Humans Write Code? Large Models Do It the Same Way Too | Feb 24, 2024 | Code GenerationMath | CodeCode Available | 0 |
| Brain-Inspired Two-Stage Approach: Enhancing Mathematical Reasoning by Imitating Human Thought Processes | Feb 23, 2024 | MathMathematical Reasoning | CodeCode Available | 0 |
| MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models | Feb 20, 2024 | Common Sense ReasoningContrastive Learning | —Unverified | 0 |
| LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks | Feb 18, 2024 | Math | —Unverified | 0 |
| Orca-Math: Unlocking the potential of SLMs in Grade School Math | Feb 16, 2024 | Arithmetic ReasoningGSM8K | —Unverified | 0 |
| Mathematical Opportunities in Digital Twins (MATH-DT) | Feb 15, 2024 | Math | —Unverified | 0 |
| Language Models with Conformal Factuality Guarantees | Feb 15, 2024 | Conformal PredictionLanguage Modeling | —Unverified | 0 |
| AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails | Feb 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications | Feb 14, 2024 | Math | —Unverified | 0 |
| GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements | Feb 13, 2024 | GSM8KMath | —Unverified | 0 |
| EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages | Feb 12, 2024 | Automated Theorem ProvingBenchmarking | —Unverified | 0 |
| Understanding the Progression of Educational Topics via Semantic Matching | Feb 10, 2024 | Math | —Unverified | 0 |
| V-STaR: Training Verifiers for Self-Taught Reasoners | Feb 9, 2024 | Code GenerationMath | —Unverified | 0 |
| In-Context Principle Learning from Mistakes | Feb 8, 2024 | GSM8KIn-Context Learning | CodeCode Available | 0 |