SOTAVerified

Decision Making

Papers

Showing 201225 of 12311 papers

TitleStatusHype
OMLT: Optimization & Machine Learning ToolkitCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous DrivingCode2
Cumulative Reasoning with Large Language ModelsCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
OverlapMamba: Novel Shift State Space Model for LiDAR-based Place RecognitionCode2
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM AgentsCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPRCode2
PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesCode2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
Aligning Superhuman AI with Human Behavior: Chess as a Model SystemCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction TuningCode2
A Survey of Financial AI: Architectures, Advances and Open ChallengesCode2
Preserving Causal Constraints in Counterfactual Explanations for Machine Learning ClassifiersCode2
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward ModelsCode2
ProAgent: From Robotic Process Automation to Agentic Process AutomationCode2
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsCode2
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
Cross-Prediction-Powered InferenceCode2
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous DrivingCode2
Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive DecodingCode2
Conformal Alignment: Knowing When to Trust Foundation Models with GuaranteesCode1
Conditioning Sparse Variational Gaussian Processes for Online Decision-makingCode1
Show:102550
← PrevPage 9 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified