SOTAVerified

Sequential Decision Making

Papers

Showing 551575 of 1210 papers

TitleStatusHype
Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization0
Local Differential Privacy for Sequential Decision Making in a Changing Environment0
Risk-Sensitive Policy with Distributional Reinforcement LearningCode1
Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent0
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning0
Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints0
Linear Combinatorial Semi-Bandit with Causally Related Rewards0
Online Statistical Inference in Decision-Making with Matrix Context0
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systemsCode1
Invariant Lipschitz Bandits: A Side Observation Approach0
Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand SystemsCode1
Reinforced Approximate Exploratory Data Analysis0
Autoregressive BanditsCode0
Information-Theoretic Safe Exploration with Gaussian ProcessesCode0
Model-based trajectory stitching for improved behavioural cloning and its applications0
Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance0
Automaton-Based Representations of Task Knowledge from Generative Language Models0
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox0
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy ApproachCode0
Continuous Episodic Control0
Is Conditional Generative Modeling all you need for Decision-Making?0
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets0
Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving0
β-Multivariational Autoencoder for Entangled Representation Learning in Video FramesCode0
Show:102550
← PrevPage 23 of 49Next →

No leaderboard results yet.