SOTAVerified

Sequential Decision Making

Papers

Showing 601650 of 1210 papers

TitleStatusHype
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning0
Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints0
Linear Combinatorial Semi-Bandit with Causally Related Rewards0
Online Statistical Inference in Decision-Making with Matrix Context0
Invariant Lipschitz Bandits: A Side Observation Approach0
Autoregressive BanditsCode0
Reinforced Approximate Exploratory Data Analysis0
Information-Theoretic Safe Exploration with Gaussian ProcessesCode0
Model-based trajectory stitching for improved behavioural cloning and its applications0
Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance0
Automaton-Based Representations of Task Knowledge from Generative Language Models0
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox0
Is Conditional Generative Modeling all you need for Decision-Making?0
Continuous Episodic Control0
Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy ApproachCode0
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets0
A Deep Reinforcement Learning Approach to Rare Event Estimation0
Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving0
β-Multivariational Autoencoder for Entangled Representation Learning in Video FramesCode0
Dynamical Linear BanditsCode0
Agent-State Construction with Auxiliary InputsCode0
Learning to Follow Instructions in Text-Based GamesCode0
Doubly Inhomogeneous Reinforcement LearningCode0
A Survey on Reinforcement Learning in Aviation Applications0
Teacher-student curriculum learning for reinforcement learning0
HSVI can solve zero-sum Partially Observable Stochastic Games0
Quantum deep recurrent reinforcement learning0
PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear BanditsCode0
Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings0
Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents0
Fine-Grained Session Recommendations in E-commerce using Deep Reinforcement Learning0
Movement Penalized Bayesian Optimization with Application to Wind Energy Systems0
Learning on the Edge: Online Learning with Stochastic Feedback Graphs0
Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop0
Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems0
Hyperbolic Deep Reinforcement Learning0
Continuous Monte Carlo Graph SearchCode0
Generalizing Bayesian Optimization with Decision-theoretic Entropies0
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
Modeling driver's evasive behavior during safety-critical lane changes:Two-dimensional time-to-collision and deep reinforcement learning0
Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making0
Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments0
Reinforcement Learning with Non-Exponential Discounting0
On Efficient Online Imitation Learning via Classification0
Deep Reinforcement Learning for Adaptive Mesh Refinement0
Non-monotonic Resource Utilization in the Bandits with Knapsacks ProblemCode0
Graph Neural Networks for Multi-Robot Active Information Acquisition0
SCALES: From Fairness Principles to Constrained Decision-MakingCode0
Batch Bayesian optimisation via density-ratio estimation with guaranteesCode0
Thompson Sampling with Virtual Helping Agents0
Show:102550
← PrevPage 13 of 25Next →

No leaderboard results yet.