SOTAVerified

Sequential Decision Making

Papers

Showing 10511100 of 1210 papers

TitleStatusHype
Conservative Contextual Bandits: Beyond Linear Representations0
Constrained Online Decision-Making: A Unified Framework0
Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms0
Contextual Bandits for Unbounded Context Distributions0
Contextual Experience Replay for Self-Improvement of Language Agents0
Contextual Online Decision Making with Infinite-Dimensional Functional Regression0
Continual Vision-and-Language Navigation0
Continuous Episodic Control0
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs0
Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning0
Convex Regularization in Monte-Carlo Tree Search0
Cooperative Bayesian Optimization for Imperfect Agents0
CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation0
CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation0
Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces0
Counterfactual Inference under Thompson Sampling0
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing0
Counterfactually Guided Off-policy Transfer in Clinical Settings0
Counterfactual Strategies for Markov Decision Processes0
CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System0
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection0
Data-Driven Online Model Selection With Regret Guarantees0
Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits0
Data-Efficient Reinforcement Learning for Malaria Control0
Data-efficient visuomotor policy training using reinforcement learning and generative models0
DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees0
DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation0
Decentralized Cross-Entropy Method for Model-Based Reinforcement Learning0
Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances0
Deceptive Sequential Decision-Making via Regularized Policy Optimization0
Deciding What to Learn: A Rate-Distortion Approach0
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning0
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits0
Decision making with dynamic probabilistic forecasts0
Deep Action Sequence Learning for Causal Shape Transformation0
Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time0
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games0
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction0
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking0
Deep Reinforcement Learning for Adaptive Mesh Refinement0
Deep Reinforcement Learning for Entity Alignment0
Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications0
Deep Reinforcement Learning for Optimal Critical Care Pain Management with Morphine using Dueling Double-Deep Q Networks0
Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module0
Deep Reinforcement Learning for Robust Goal-Based Wealth Management0
Selective Network Discovery via Deep Reinforcement Learning on Embedded Spaces0
Deep Reinforcement Learning for Visual Object Tracking in Videos0
Show:102550
← PrevPage 22 of 25Next →

No leaderboard results yet.