SOTAVerified

Sequential Decision Making

Papers

Showing 551600 of 1210 papers

TitleStatusHype
Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization0
Local Differential Privacy for Sequential Decision Making in a Changing Environment0
Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent0
Risk-Sensitive Policy with Distributional Reinforcement LearningCode1
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning0
Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints0
Linear Combinatorial Semi-Bandit with Causally Related Rewards0
Online Statistical Inference in Decision-Making with Matrix Context0
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systemsCode1
Invariant Lipschitz Bandits: A Side Observation Approach0
Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand SystemsCode1
Autoregressive BanditsCode0
Reinforced Approximate Exploratory Data Analysis0
Information-Theoretic Safe Exploration with Gaussian ProcessesCode0
Model-based trajectory stitching for improved behavioural cloning and its applications0
Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance0
Automaton-Based Representations of Task Knowledge from Generative Language Models0
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox0
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy ApproachCode0
Continuous Episodic Control0
Is Conditional Generative Modeling all you need for Decision-Making?0
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets0
Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving0
β-Multivariational Autoencoder for Entangled Representation Learning in Video FramesCode0
A Deep Reinforcement Learning Approach to Rare Event Estimation0
UniMASK: Unified Inference in Sequential Decision ProblemsCode1
Dynamical Linear BanditsCode0
Agent-State Construction with Auxiliary InputsCode0
Doubly Inhomogeneous Reinforcement LearningCode0
Learning to Follow Instructions in Text-Based GamesCode0
A Survey on Reinforcement Learning in Aviation Applications0
Dungeons and Data: A Large-Scale NetHack DatasetCode2
Teacher-student curriculum learning for reinforcement learning0
HSVI can solve zero-sum Partially Observable Stochastic Games0
Quantum deep recurrent reinforcement learning0
PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear BanditsCode0
Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings0
Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents0
Fine-Grained Session Recommendations in E-commerce using Deep Reinforcement Learning0
Movement Penalized Bayesian Optimization with Application to Wind Energy Systems0
Markup-to-Image Diffusion Models with Scheduled SamplingCode1
Learning on the Edge: Online Learning with Stochastic Feedback Graphs0
Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop0
Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems0
Hyperbolic Deep Reinforcement Learning0
Continuous Monte Carlo Graph SearchCode0
Generalizing Bayesian Optimization with Decision-theoretic Entropies0
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy OptimizationCode1
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
Show:102550
← PrevPage 12 of 25Next →

No leaderboard results yet.