| Task Generalization With AutoRegressive Compositional Structure: Can Learning From Tasks Generalize to ^T Tasks? | Feb 13, 2025 | ARCIn-Context Learning | —Unverified | 0 |
| Vision-Language In-Context Learning Driven Few-Shot Visual Inspection Model | Feb 13, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| AuPair: Golden Example Pairs for Code Repair | Feb 12, 2025 | Code RepairIn-Context Learning | —Unverified | 0 |
| In-Context Learning of Linear Dynamical Systems with Transformers: Error Bounds and Depth-Separation | Feb 12, 2025 | In-Context Learning | —Unverified | 0 |
| Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction | Feb 12, 2025 | Grammatical Error CorrectionIn-Context Learning | CodeCode Available | 0 |
| The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models | Feb 11, 2025 | DecoderIn-Context Learning | —Unverified | 0 |
| Hallucination, Monofacts, and Miscalibration: An Empirical Investigation | Feb 11, 2025 | DecoderHallucination | CodeCode Available | 0 |
| Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning | Feb 11, 2025 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models | Feb 10, 2025 | In-Context Learning | —Unverified | 0 |
| MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations | Feb 10, 2025 | BenchmarkingIn-Context Learning | —Unverified | 0 |