| Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification | Oct 5, 2024 | GSM8KMath | —Unverified | 0 |
| BloomWise: Enhancing Problem-Solving capabilities of Large Language Models using Bloom's-Taxonomy-Inspired Prompts | Oct 5, 2024 | Math | —Unverified | 0 |
| Deliberate Reasoning for LLMs as Structure-aware Planning with Accurate World Model | Oct 4, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor Factorization for Compression of Generative Language Models | Oct 3, 2024 | AllLanguage Modeling | —Unverified | 0 |
| CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning | Oct 3, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Towards the Pedagogical Steering of Large Language Models for Tutoring: A Case Study with Modeling Productive Failure | Oct 3, 2024 | Math | CodeCode Available | 0 |
| Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection | Oct 3, 2024 | Mathparameter-efficient fine-tuning | CodeCode Available | 0 |
| Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation | Oct 3, 2024 | GSM8KMath | —Unverified | 0 |
| An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task Settings | Oct 2, 2024 | 8kMath | CodeCode Available | 0 |
| Evaluating Robustness of Reward Models for Mathematical Reasoning | Oct 2, 2024 | MathMathematical Reasoning | —Unverified | 0 |