SOTAVerified

Decision Making

Papers

Showing 35013550 of 12311 papers

TitleStatusHype
On the Performance of Empirical Risk Minimization with Smoothed Data0
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay0
We Choose to Go to Space: Agent-driven Human and Multi-Robot Collaboration in Microgravity0
Reframing the Expected Free Energy: Four Formulations and a Unification0
Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems0
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models0
Computation Offloading for Multi-server Multi-access Edge Vehicular Networks: A DDQN-based Method0
Self-Supervised Interpretable End-to-End Learning via Latent Functional Modularity0
Testing autonomous vehicles and AI: perspectives and challenges from cybersecurity, transparency, robustness and fairness0
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action ChainCode2
Beyond A*: Better Planning with Transformers via Search Dynamics BootstrappingCode3
A Neuro-Symbolic Approach to Multi-Agent RL for Interpretability and Probabilistic Decision Making0
Analyizing the Conjunction Fallacy as a Fact0
Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating0
SaGE: Evaluating Moral Consistency in Large Language ModelsCode0
Efficient Normalized Conformal Prediction and Uncertainty Quantification for Anti-Cancer Drug Sensitivity Prediction with Deep Regression Forests0
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Voice-Driven Mortality Prediction in Hospitalized Heart Failure Patients: A Machine Learning Approach Enhanced with Diagnostic Biomarkers0
Social Environment DesignCode0
Generative Probabilistic Time Series Forecasting and Applications in Grid Operations0
Hybrid Reasoning Based on Large Language Models for Autonomous Car DrivingCode0
PRECISE Framework: GPT-based Text For Improved Readability, Reliability, and Understandability of Radiology Reports For Patient-Centered Care0
OpenHEXAI: An Open-Source Framework for Human-Centered Evaluation of Explainable Machine Learning0
Referee-Meta-Learning for Fast Adaptation of Locational Fairness0
Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers0
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic SpacesCode0
What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based AgentsCode0
Testing Calibration in Nearly-Linear TimeCode0
Reflect-RL: Two-Player Online RL Fine-Tuning for LMsCode1
Are LLMs Rational Investors? A Study on Detecting and Reducing the Financial Bias in LLMs0
XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning TechniquesCode1
Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention StrategiesCode0
Align Your Intents: Offline Imitation Learning via Optimal Transport0
Random Graph Set and Evidence Pattern Reasoning Model0
Multimodal Fusion of EHR in Structures and Semantics: Integrating Clinical Records and Notes with Hypergraph and LLM0
Applying News and Media Sentiment Analysis for Generating Forex Trading Signals0
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?Code0
Towards AI-Based Precision Oncology: A Machine Learning Framework for Personalized Counterfactual Treatment Suggestions based on Multi-Omics Data0
Synthetic location trajectory generation using categorical diffusion modelsCode0
UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal PredictionCode3
Statistical Test on Diffusion Model-based Anomaly Detection by Selective Inference0
Multi-View Conformal Learning for Heterogeneous Sensor FusionCode0
Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic0
All Language Models Large and Small0
Examining Monitoring System: Detecting Abnormal Behavior In Online Examinations0
MM-SurvNet: Deep Learning-Based Survival Risk Stratification in Breast Cancer Through Multimodal Data Fusion0
Stackelberg reinsurance and premium decisions with MV criterion and irreversibility0
Pattern-wise Transparent Sequential Recommendation0
EndoOOD: Uncertainty-aware Out-of-distribution Detection in Capsule Endoscopy Diagnosis0
Self-evolving Autoencoder Embedded Q-Network0
Show:102550
← PrevPage 71 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified