SOTAVerified

Sequential Decision Making

Papers

Showing 9511000 of 1210 papers

TitleStatusHype
A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding0
Learning Sparse Rewarded Tasks from Sub-Optimal DemonstrationsCode0
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved ConfoundingCode0
Learning Discrete State Abstractions With Deep Variational InferenceCode0
Human AI interaction loop training: New approach for interactive reinforcement learning0
A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option0
Distributional Robustness and Regularization in Reinforcement Learning0
Exploration-Exploitation in Constrained MDPs0
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?Code1
Structure-Adaptive Sequential Testing for Online False Discovery Rate Control0
Reinforcement Learning of Risk-Constrained Policies in Markov Decision ProcessesCode0
Information Directed Sampling for Linear Partial Monitoring0
Learning Dynamic Belief Graphs to Generalize on Text-Based GamesCode1
Online Batch Decision-Making with High-Dimensional Covariates0
Weakly-supervised Multi-output Regression via Correlated Gaussian Processes0
Legion: Best-First Concolic Testing0
PDDLGym: Gym Environments from PDDL ProblemsCode1
Listwise Learning to Rank with Deep Q-Networks0
Verifiable RNN-Based Policies for POMDPs Under Temporal Logic Constraints0
Tight Lower Bounds for Combinatorial Multi-Armed Bandits0
Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic0
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning0
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations0
Computing the Feedback Capacity of Finite State Channels using Reinforcement LearningCode0
Fairness in Learning-Based Sequential Decision Algorithms: A Survey0
Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon SettingsCode0
A storage expansion planning framework using reinforcement learning and simulation-based optimization0
On Computation and Generalization of Generative Adversarial Imitation Learning0
Direct and indirect reinforcement learning0
Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances0
Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaRCode0
Maximum Entropy Monte-Carlo Planning0
SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional PoliciesCode0
An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks0
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms0
Planning with Goal-Conditioned PoliciesCode0
Working Memory Graphs0
One-shot learning and behavioral eligibility traces in sequential decision making0
A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro DataCode0
Adaptivity in Adaptive Submodularity0
Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations0
Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints0
Thompson Sampling via Local UncertaintyCode0
Policy Learning for Malaria ControlCode0
Adaptive Exploration in Linear Contextual Bandit0
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for PythonCode0
Deep Q-Network for Angry BirdsCode0
Show:102550
← PrevPage 20 of 25Next →

No leaderboard results yet.