SOTAVerified

Sequential Decision Making

Papers

Showing 351375 of 1210 papers

TitleStatusHype
Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product SearchCode0
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback0
CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System0
Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI0
AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments0
Learning Planning Abstractions from Language0
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows0
Enhancing Q-Learning with Large Language Model Heuristics0
MEXGEN: An Effective and Efficient Information Gain Approximation for Information Gathering Path Planning0
Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery0
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback0
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks0
Q-learning with temporal memory to navigate turbulence0
Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment0
What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement LearningCode0
Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation0
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and DetectionCode0
Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet ControlCode0
Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery0
Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityCode0
Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus ErythematosusCode0
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment0
Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks0
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models0
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems0
Show:102550
← PrevPage 15 of 49Next →

No leaderboard results yet.