SOTAVerified

Sequential Decision Making

Papers

Showing 9511000 of 1210 papers

TitleStatusHype
Reinforcement LearningCode0
Dynamic Bi-Objective Routing of Multiple Vehicles0
Active Measure Reinforcement Learning for Observation Cost Minimization0
Causal Bayesian Optimization0
Implementability of Honest Multi-Agent Sequential Decision-Making with Dynamic Population0
Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement LearningCode0
Scalable First-Order Methods for Robust MDPs0
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning0
iCORPP: Interleaved Commonsense Reasoning and Probabilistic Planning on Robots0
Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems0
Sequential Batch Learning in Finite-Action Linear Contextual Bandits0
Distributed Learning: Sequential Decision Making in Resource-Constrained Environments0
A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding0
Learning Sparse Rewarded Tasks from Sub-Optimal DemonstrationsCode0
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved ConfoundingCode0
Learning Discrete State Abstractions With Deep Variational InferenceCode0
Human AI interaction loop training: New approach for interactive reinforcement learning0
A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option0
Distributional Robustness and Regularization in Reinforcement Learning0
Exploration-Exploitation in Constrained MDPs0
Structure-Adaptive Sequential Testing for Online False Discovery Rate Control0
Reinforcement Learning of Risk-Constrained Policies in Markov Decision ProcessesCode0
Information Directed Sampling for Linear Partial Monitoring0
Online Batch Decision-Making with High-Dimensional Covariates0
Weakly-supervised Multi-output Regression via Correlated Gaussian Processes0
Legion: Best-First Concolic Testing0
Verifiable RNN-Based Policies for POMDPs Under Temporal Logic Constraints0
Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic0
Tight Lower Bounds for Combinatorial Multi-Armed Bandits0
Listwise Learning to Rank with Deep Q-Networks0
Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning0
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations0
Computing the Feedback Capacity of Finite State Channels using Reinforcement LearningCode0
Fairness in Learning-Based Sequential Decision Algorithms: A Survey0
Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon SettingsCode0
A storage expansion planning framework using reinforcement learning and simulation-based optimization0
On Computation and Generalization of Generative Adversarial Imitation Learning0
Direct and indirect reinforcement learning0
Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances0
Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaRCode0
Maximum Entropy Monte-Carlo Planning0
SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional PoliciesCode0
An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks0
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms0
Planning with Goal-Conditioned PoliciesCode0
Working Memory Graphs0
One-shot learning and behavioral eligibility traces in sequential decision making0
A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro DataCode0
Adaptivity in Adaptive Submodularity0
Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations0
Show:102550
← PrevPage 20 of 25Next →

No leaderboard results yet.