SOTAVerified

Decision Making

Papers

Showing 576600 of 12311 papers

TitleStatusHype
Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding0
Accelerating drug discovery with Artificial: a whole-lab orchestration and scheduling system for self-driving labs0
Quantum Random Lunch Generator (QRLG)Code0
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDBCode3
Role and Use of Race in AI/ML Models Related to Health0
Enhancing Time Series Forecasting with Fuzzy Attention-Integrated TransformersCode0
Detecting Malicious AI Agents Through Simulated Interactions0
New Statistical Framework for Extreme Error Probability in High-Stakes Domains for Reliable Machine Learning0
Frequency-Aware Attention-LSTM for PM_2.5 Time Series Forecasting0
The more the merrier: logical and multistage processors in credit scoringCode0
When Counterfactual Reasoning Fails: Chaos and Real-World Complexity0
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective0
Agent-Based Simulations of Online Political Discussions: A Case Study on Elections in Germany0
LLM4FS: Leveraging Large Language Models for Feature Selection and How to Improve It0
Language Guided Concept Bottleneck Models for Interpretable Continual LearningCode1
What Makes an Evaluation Useful? Common Pitfalls and Best Practices0
OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action ModelCode4
Exploring Explainable Multi-player MCTS-minimax Hybrids in Board Game Using Process Mining0
Towards Trustworthy GUI Agents: A SurveyCode0
Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game ApproachCode0
Iterative VCG-based Mechanism Fosters Cooperation in Multi-Regional Network Design0
Towards Interpretable Counterfactual Generation via Multimodal Autoregression0
A Training-free LLM Framework with Interaction between Contextually Related Subtasks in Solving Complex Tasks0
Towards Personalized Conversational Sales Agents : Contextual User Profiling for Strategic Action0
GroundHog: Revolutionizing GLDAS Groundwater Storage Downscaling for Enhanced Recharge Estimation in Bangladesh0
Show:102550
← PrevPage 24 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified