| Improving Physics Reasoning in Large Language Models Using Mixture of Refinement Agents | Dec 1, 2024 | Mathematical ReasoningMMLU | —Unverified | 0 |
| Simple and Provable Scaling Laws for the Test-Time Compute of Large Language Models | Nov 29, 2024 | MMLU | —Unverified | 0 |
| Mixture of Cache-Conditional Experts for Efficient Mobile Device Inference | Nov 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Predicting Emergent Capabilities by Finetuning | Nov 25, 2024 | CoLAGSM8K | —Unverified | 0 |
| Learning from "Silly" Questions Improves Large Language Models, But Only Slightly | Nov 21, 2024 | EconometricsGlobal Facts | —Unverified | 0 |
| GenBFA: An Evolutionary Optimization Approach to Bit-Flip Attacks on LLMs | Nov 21, 2024 | MMLUText Generation | —Unverified | 0 |
| Real-time Adapting Routing (RAR): Improving Efficiency Through Continuous Learning in Software Powered by Layered Foundation Models | Nov 14, 2024 | Domain GeneralizationIn-Context Learning | —Unverified | 0 |
| Reasoning Robustness of LLMs to Adversarial Typographical Errors | Nov 8, 2024 | GSM8KMMLU | —Unverified | 0 |
| Watson: A Cognitive Observability Framework for the Reasoning of LLM-Powered Agents | Nov 5, 2024 | MMLU | —Unverified | 0 |
| TODO: Enhancing LLM Alignment with Ternary Preferences | Nov 2, 2024 | ARCMMLU | CodeCode Available | 0 |