| StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error | Mar 13, 2025 | Math | CodeCode Available | 0 |
| Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression | Mar 13, 2025 | Code GenerationConformal Prediction | —Unverified | 0 |
| Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning | Mar 13, 2025 | In-Context LearningMath | —Unverified | 0 |
| Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data | Mar 13, 2025 | Large Language ModelMath | —Unverified | 0 |
| Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning | Mar 10, 2025 | MathMeta Reinforcement Learning | —Unverified | 0 |
| From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics | Mar 10, 2025 | MathQuestion Answering | —Unverified | 0 |
| Decoding the Black Box: Integrating Moral Imagination with Technical AI Governance | Mar 9, 2025 | EthicsMath | —Unverified | 0 |
| InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models | Mar 9, 2025 | Computational EfficiencyMath | —Unverified | 0 |
| Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning | Mar 7, 2025 | GPUMath | —Unverified | 0 |
| SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning | Mar 6, 2025 | GSM8KMath | —Unverified | 0 |