SOTAVerified

Decision Making

Papers

Showing 17011750 of 12311 papers

TitleStatusHype
Continual learning via probabilistic exchangeable sequence modelling0
Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes0
A theory of anticipated surprise for understanding risky intertemporal choices0
MARS: Memory-Enhanced Agents with Reflective Self-improvement0
Quantifying Symptom Causality in Clinical Decision Making: An Exploration Using CausaLM0
LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused EvaluationCode0
Multi-agent Application System in Office Collaboration Scenarios0
CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model0
High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting0
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages0
Statistical Proof of Execution (SPEX)0
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners0
EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments0
Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning0
Reinforcement Learning in Switching Non-Stationary Markov Decision Processes: Algorithms and Convergence Analysis0
Generative AI in Knowledge Work: Design Implications for Data Navigation and Decision-Making0
Surgical Action Planning with Large Language Models0
Strategic Prompt Pricing for AIGC Services: A User-Centric Approach0
Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree0
Active Inference for Energy Control and Planning in Smart Buildings and Communities0
A Qualitative Study of User Perception of M365 AI Copilot0
Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-CritiqueCode0
Sparse Additive Contextual Bandits: A Nonparametric Approach for Online Decision-making with High-dimensional Covariates0
When Words Outperform Vision: VLMs Can Self-Improve Via Text-Only Training For Human-Centered Decision Making0
A-IDE : Agent-Integrated Denoising Experts0
When Debate Fails: Bias Reinforcement in Large Language Models0
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis PredictionCode0
Information maximization for a broad variety of multi-armed bandit games0
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment0
Feature selection strategies for optimized heart disease diagnosis using ML and DL models0
Deferring Concept Bottleneck Models: Learning to Defer Interventions to Inaccurate Experts0
Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey0
Limits of trust in medical AI0
Large Language Models for Water Distribution Systems Modeling and Decision-Making0
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse0
AI-Driven Sentiment Analytics: Unlocking Business Value in the E-Commerce Landscape_v10
Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education0
Depth Matters: Multimodal RGB-D Perception for Robust Autonomous AgentsCode0
Are AI Agents interacting with Online Ads?0
Accelerating Antibiotic Discovery with Large Language Models and Knowledge Graphs0
Truthful Elicitation of Imprecise Forecasts0
AIJIM: A Scalable Model for Real-Time AI in Environmental Journalism0
Bias Evaluation and Mitigation in Retrieval-Augmented Medical Question-Answering Systems0
Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat0
Diffusion-Based Forecasting for Uncertainty-Aware Model Predictive Control0
Learning with Expert Abstractions for Efficient Multi-Task Continuous ControlCode0
Empowering Medical Multi-Agents with Clinical Consultation Flow for Dynamic Diagnosis0
VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making0
Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems0
When Pigs Get Sick: Multi-Agent AI for Swine Disease Detection0
Show:102550
← PrevPage 35 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified