| Context is Key: A Benchmark for Forecasting with Essential Textual Information | Oct 24, 2024 | Decision MakingTime Series | CodeCode Available | 2 | 5 |
| Aligning Superhuman AI with Human Behavior: Chess as a Model System | Jun 2, 2020 | Decision Making | CodeCode Available | 2 | 5 |
| A Comprehensive Guide to Explainable AI: From Classical Models to LLMs | Dec 1, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 | 5 |
| Position: Foundation Agents as the Paradigm Shift for Decision Making | May 27, 2024 | Decision MakingPosition | CodeCode Available | 2 | 5 |
| Cross-Prediction-Powered Inference | Sep 28, 2023 | Decision MakingMissing Labels | CodeCode Available | 2 | 5 |
| ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild | Jul 4, 2024 | Chart UnderstandingDecision Making | CodeCode Available | 2 | 5 |
| PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models | Jan 6, 2025 | Decision Making | CodeCode Available | 2 | 5 |
| ProAgent: From Robotic Process Automation to Agentic Process Automation | Nov 2, 2023 | Decision Making | CodeCode Available | 2 | 5 |
| DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation | Nov 18, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |
| V-Max: A Reinforcement Learning Framework for Autonomous Driving | Mar 11, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |