SOTAVerified

Sequential Decision Making

Papers

Showing 351400 of 1210 papers

TitleStatusHype
Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product SearchCode0
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback0
CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System0
Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI0
AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments0
Learning Planning Abstractions from Language0
Enhancing Q-Learning with Large Language Model Heuristics0
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows0
MEXGEN: An Effective and Efficient Information Gain Approximation for Information Gathering Path Planning0
Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery0
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback0
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks0
Q-learning with temporal memory to navigate turbulence0
Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment0
What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement LearningCode0
Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation0
Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery0
Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet ControlCode0
Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityCode0
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and DetectionCode0
Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus ErythematosusCode0
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment0
Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks0
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models0
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems0
Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian AdaptationCode0
Continual Vision-and-Language Navigation0
Sequential Decision-Making for Inline Text Autocomplete0
Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion0
Fast Value Tracking for Deep Reinforcement Learning0
Supervised Fine-Tuning as Inverse Reinforcement Learning0
State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards0
Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC0
Regret Minimization via Saddle Point Optimization0
AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents0
CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation0
LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem0
Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem0
Cooperative Bayesian Optimization for Imperfect Agents0
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation0
Language Guided Exploration for RL Agents in Text Environments0
Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds0
On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games0
Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections0
Reward Design for Justifiable Sequential Decision-MakingCode0
Information-Theoretic Safe Bayesian Optimization0
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay0
On the Performance of Empirical Risk Minimization with Smoothed Data0
Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers0
Align Your Intents: Offline Imitation Learning via Optimal Transport0
Show:102550
← PrevPage 8 of 25Next →

No leaderboard results yet.