SOTAVerified

Decision Making

Papers

Showing 201225 of 12311 papers

TitleStatusHype
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPRCode2
OMLT: Optimization & Machine Learning ToolkitCode2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesCode2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous DrivingCode2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingCode2
OverlapMamba: Novel Shift State Space Model for LiDAR-based Place RecognitionCode2
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
ADAPT: Action-aware Driving Caption TransformerCode2
PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesCode2
Polis: Scaling Deliberation by Mapping High Dimensional Opinion SpacesCode2
Position: Foundation Agents as the Paradigm Shift for Decision MakingCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
Can Graph Learning Improve Planning in LLM-based Agents?Code2
AGIEval: A Human-Centric Benchmark for Evaluating Foundation ModelsCode2
ProAgent: From Robotic Process Automation to Agentic Process AutomationCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
CausalPFN: Amortized Causal Effect Estimation via In-Context LearningCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Do As I Can, Not As I Say: Grounding Language in Robotic AffordancesCode2
MACRec: a Multi-Agent Collaboration Framework for RecommendationCode2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
Conformal Alignment: Knowing When to Trust Foundation Models with GuaranteesCode1
Conditioning Sparse Variational Gaussian Processes for Online Decision-makingCode1
Show:102550
← PrevPage 9 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified