SOTAVerified

Decision Making

Papers

Showing 98269850 of 12311 papers

TitleStatusHype
Calibrating Where It Matters: Constrained Temperature Scaling0
Boldness-Recalibration for Binary Event Predictions0
Online Calibrated and Conformal Prediction Improves Bayesian Optimization0
Calibration of Human Driving Behavior and Preference Using Naturalistic Traffic Data0
CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems0
Can AI Extract Antecedent Factors of Human Trust in AI? An Application of Information Extraction for Scientific Literature in Behavioural and Computer Sciences0
Can an AI agent hit a moving target?0
Can an AI Agent Safely Run a Government? Existence of Probably Approximately Aligned Policies0
Can Attention-based Transformers Explain or Interpret Cyberbullying Detection?0
Can A User Anticipate What Her Followers Want?0
Can Automatic Metrics Assess High-Quality Translations?0
Can Bio-Inspired Swarm Algorithms Scale to Modern Societal Problems0
Cancer-Answer: Empowering Cancer Care with Advanced Large Language Models0
Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models0
Can Domain Experts Rely on AI Appropriately? A Case Study on AI-Assisted Prostate Cancer MRI Diagnosis0
Can Education Motivate Individual Health Demands? Dynamic Pseudo-panel Evidence from China's Immigration0
Can Interpretable Reinforcement Learning Manage Prosperity Your Way?0
Can Language Models Serve as Text-Based World Simulators?0
Can Language Representation Models Think in Bets?0
Can Large Language Models Beat Wall Street? Unveiling the Potential of AI in Stock Selection0
Can large language models explore in-context?0
Can Large Language Models Play Games? A Case Study of A Self-Play Approach0
Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling?0
Can LLMs be Good Financial Advisors?: An Initial Study in Personal Decision Making for Optimized Outcomes0
Can LLMs Grade Short-Answer Reading Comprehension Questions : An Empirical Study with a Novel Dataset0
Show:102550
← PrevPage 394 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified