SOTAVerified

Sequential Decision Making

Papers

Showing 751800 of 1210 papers

TitleStatusHype
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
Pessimistic Model Selection for Offline Deep Reinforcement Learning0
Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation0
Neural Column Generation for Capacitated Vehicle Routing0
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability0
Adversarial Deep Learning for Online Resource Allocation0
Deep Reinforcement Learning for Entity Alignment0
Route Optimization via Environment-Aware Deep Network and Reinforcement Learning0
AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive CrossbarsCode0
Automatic Goal Generation using Dynamical Distance Learning0
SOPE: Spectrum of Off-Policy EstimatorsCode0
Regular Decision Processes for Grid Worlds0
Partial-Adaptive Submodular Maximization0
A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning0
The Value of Information When Deciding What to Learn0
HSVI for zs-POSGs using Concavity, Convexity and Lipschitz Properties0
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits0
ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive ModelsCode0
Anti-Concentrated Confidence Bonuses for Scalable Exploration0
Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized RecommendationsCode0
SS-MAIL: Self-Supervised Multi-Agent Imitation Learning0
Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network0
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning0
When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits0
Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations0
Gambits: Theory and Evidence0
Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams0
Decentralized Cross-Entropy Method for Model-Based Reinforcement Learning0
Generalizing Successor Features to continuous domains for Multi-task Learning0
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
Goal Randomization for Playing Text-based Games without a Reward Function0
PDQN - A Deep Reinforcement Learning Method for Planning with Long Delays: Optimization of Manufacturing Dispatching0
Neural Bootstrapping Attention for Neural Processes0
Maximizing Ensemble Diversity in Deep Reinforcement Learning0
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey0
Reinforcement Learning for Quantitative Trading0
The f-Divergence Reinforcement Learning Framework0
Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning0
On Optimal Robustness to Adversarial Corruption in Online Decision Problems0
Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time0
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation0
Learning-to-defer for sequential medical decision-making under uncertainty0
Federated Ensemble Model-based Reinforcement Learning in Edge Computing0
Temporal Shift Reinforcement LearningCode0
Optimal Path Planning of Autonomous Marine Vehicles in Stochastic Dynamic Ocean Flows using a GPU-Accelerated Algorithm0
No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees0
Sequential Stochastic Optimization in Separable Learning Environments0
Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey0
Improving Human Sequential Decision-Making with Reinforcement Learning0
TDM: Trustworthy Decision-Making via Interpretability Enhancement0
Show:102550
← PrevPage 16 of 25Next →

No leaderboard results yet.