SOTAVerified

Sequential Decision Making

Papers

Showing 501550 of 1210 papers

TitleStatusHype
Accelerating exploration and representation learning with offline pre-training0
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from ObservationsCode0
Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costsCode0
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey0
Reflexion: Language Agents with Verbal Reinforcement LearningCode4
Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs0
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP0
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning0
Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesCode0
Sample-efficient Adversarial Imitation Learning0
Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex NetworksCode0
Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards0
Automated Cyber Defence: A Review0
Exploration via Epistemic Value Estimation0
adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems0
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning0
Causal Explanations for Sequential Decision-Making in Multi-Agent SystemsCode0
Minimax-Bayes Reinforcement LearningCode0
Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical SystemsCode0
Trieste: Efficiently Exploring The Depths of Black-box Functions with TensorFlowCode2
Best Arm Identification for Stochastic Rising BanditsCode0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Effective Dimension in Bandit Problems under Censorship0
Scalable Bayesian optimization with high-dimensional outputs using randomized prior networksCode0
Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits0
A Survey on Causal Reinforcement Learning0
Multi-task Representation Learning for Pure Exploration in Linear Bandits0
A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis0
Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications0
A Strong Baseline for Batch Imitation Learning0
Variational Information Pursuit for Interpretable PredictionsCode1
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback0
Learning Universal Policies via Text-Guided Video Generation0
Learning Coordination Policies over Heterogeneous Graphs for Human-Robot Teams via Recurrent Neural Schedule PropagationCode0
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation0
On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures0
Off-Policy Evaluation for Action-Dependent Non-Stationary EnvironmentsCode0
SMART: Self-supervised Multi-task pretrAining with contRol Transformers0
Inducing Point Allocation for Sparse Gaussian Processes in High-Throughput Bayesian Optimisation0
The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision MakingCode0
GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation0
Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement LearningCode0
Differential Privacy in Cooperative Multiagent PlanningCode0
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits0
Neuro-Symbolic World Models for Adapting to Open World Novelty0
Neuro-symbolic Meta Reinforcement Learning for Trading0
Fairness and Sequential Decision Making: Limits, Lessons, and Opportunities0
Asynchronous training of quantum reinforcement learning0
Sequential Fair Resource Allocation under a Markov Decision Process Framework0
RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm0
Show:102550
← PrevPage 11 of 25Next →

No leaderboard results yet.