| Accelerating exploration and representation learning with offline pre-training | Mar 31, 2023 | Decision MakingNetHack | —Unverified | 0 |
| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Mar 30, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs | Mar 29, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Boosting Reinforcement Learning and Planning with Demonstrations: A Survey | Mar 23, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs | Mar 18, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP | Mar 16, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning | Mar 15, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Sample-efficient Adversarial Imitation Learning | Mar 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 |
| Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks | Mar 9, 2023 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards | Mar 9, 2023 | Decision Makingregression | —Unverified | 0 |
| Automated Cyber Defence: A Review | Mar 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Exploration via Epistemic Value Estimation | Mar 7, 2023 | Decision MakingEfficient Exploration | —Unverified | 0 |
| adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems | Mar 7, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning | Mar 2, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Causal Explanations for Sequential Decision-Making in Multi-Agent Systems | Feb 21, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 0 |
| Minimax-Bayes Reinforcement Learning | Feb 21, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical Systems | Feb 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Best Arm Identification for Stochastic Rising Bandits | Feb 15, 2023 | Decision MakingModel Selection | CodeCode Available | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Effective Dimension in Bandit Problems under Censorship | Feb 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Scalable Bayesian optimization with high-dimensional outputs using randomized prior networks | Feb 14, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits | Feb 12, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Survey on Causal Reinforcement Learning | Feb 10, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Multi-task Representation Learning for Pure Exploration in Linear Bandits | Feb 9, 2023 | Decision MakingRepresentation Learning | —Unverified | 0 |