SOTAVerified

Decision Making

Papers

Showing 16711680 of 12311 papers

TitleStatusHype
Agent-Based Simulations of Online Political Discussions: A Case Study on Elections in Germany0
Detecting Malicious AI Agents Through Simulated Interactions0
Exploring Explainable Multi-player MCTS-minimax Hybrids in Board Game Using Process Mining0
What Makes an Evaluation Useful? Common Pitfalls and Best Practices0
Towards Trustworthy GUI Agents: A SurveyCode0
Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game ApproachCode0
A Training-free LLM Framework with Interaction between Contextually Related Subtasks in Solving Complex Tasks0
Iterative VCG-based Mechanism Fosters Cooperation in Multi-Regional Network Design0
Towards Interpretable Counterfactual Generation via Multimodal Autoregression0
When Autonomy Breaks: The Hidden Existential Risk of AI0
Show:102550
← PrevPage 168 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified