SOTAVerified

Decision Making

Papers

Showing 17011725 of 12311 papers

TitleStatusHype
Wasserstein Distributionally Robust Bayesian Optimization with Continuous ContextCode0
LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused EvaluationCode0
Quantifying Symptom Causality in Clinical Decision Making: An Exploration Using CausaLM0
CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model0
High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting0
A theory of anticipated surprise for understanding risky intertemporal choices0
Multi-agent Application System in Office Collaboration Scenarios0
MARS: Memory-Enhanced Agents with Reflective Self-improvement0
Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes0
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners0
Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning0
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages0
Statistical Proof of Execution (SPEX)0
Reinforcement Learning in Switching Non-Stationary Markov Decision Processes: Algorithms and Convergence Analysis0
EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments0
Generative AI in Knowledge Work: Design Implications for Data Navigation and Decision-Making0
Surgical Action Planning with Large Language Models0
Strategic Prompt Pricing for AIGC Services: A User-Centric Approach0
Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree0
Active Inference for Energy Control and Planning in Smart Buildings and Communities0
A Qualitative Study of User Perception of M365 AI Copilot0
A-IDE : Agent-Integrated Denoising Experts0
When Debate Fails: Bias Reinforcement in Large Language Models0
Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-CritiqueCode0
When Words Outperform Vision: VLMs Can Self-Improve Via Text-Only Training For Human-Centered Decision Making0
Show:102550
← PrevPage 69 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified