SOTAVerified

Decision Making

Papers

Showing 13261350 of 12311 papers

TitleStatusHype
Pedagogy-R1: Pedagogically-Aligned Reasoning Model with Balanced Educational Benchmark0
Cognitive Biases at Play? Insights from a Bayesian Game Framework0
Marginal Fairness: Fair Decision-Making under Risk Measures0
KL-regularization Itself is Differentially Private in Bandits and RLHF0
Development of Interactive Nomograms for Predicting Short-Term Survival in ICU Patients with Aplastic Anemia0
Towards Uncertainty Aware Task Delegation and Human-AI Collaborative Decision-Making0
Misaligning Reasoning with Answers -- A Framework for Assessing LLM CoT Robustness0
VIBE: Video-to-Text Information Bottleneck Evaluation for TL;DRCode0
An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems0
EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion0
Learning Representational Disparities0
Fuzzy Information Evolution with Three-Way Decision in Social Network Group Decision-Making0
Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach0
Multi-Objective Optimization Algorithms for Energy Management Systems in Microgrids: A Control Strategy Based on a PHIL System0
No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery0
Sequential Monte Carlo for Policy Optimization in Continuous POMDPs0
SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving0
Semantic-Aware Interpretable Multimodal Music Auto-TaggingCode0
AppealCase: A Dataset and Benchmark for Civil Case Appeal ScenariosCode0
Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval Augmented Generation (RAG)0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
Identification of Probabilities of Causation: A Complete Characterization0
ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding ValidationCode0
Gaussian Processes in Power Systems: Techniques, Applications, and Future Works0
Explaining Puzzle Solutions in Natural Language: An Exploratory Study on 6x6 Sudoku0
Show:102550
← PrevPage 54 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified