SOTAVerified

Sequential Decision Making

Papers

Showing 276300 of 1210 papers

TitleStatusHype
Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI0
Learning Planning Abstractions from Language0
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows0
Enhancing Q-Learning with Large Language Model Heuristics0
MEXGEN: An Effective and Efficient Information Gain Approximation for Information Gathering Path Planning0
Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery0
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback0
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks0
Q-learning with temporal memory to navigate turbulence0
Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment0
What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement LearningCode0
Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation0
Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityCode0
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and DetectionCode0
Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery0
Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet ControlCode0
Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus ErythematosusCode0
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment0
Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks0
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models0
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State SpacesCode1
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems0
Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian AdaptationCode0
Continual Vision-and-Language Navigation0
Sequential Decision-Making for Inline Text Autocomplete0
Show:102550
← PrevPage 12 of 49Next →

No leaderboard results yet.