| Common Benchmarks Undervalue the Generalization Power of Programmatic Policies | Jun 17, 2025 | Sequential Decision Making | CodeCode Available | 0 | 5 |
| Adversarially Robust Decision Transformer | Jul 25, 2024 | Adversarial RobustnessSequential Decision Making | CodeCode Available | 0 | 5 |
| Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies | Dec 24, 2024 | Deep Reinforcement LearningImputation | CodeCode Available | 0 | 5 |
| Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks | Mar 9, 2023 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization | Feb 12, 2024 | Bayesian OptimizationDecision Making | CodeCode Available | 0 | 5 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Dynamical Linear Bandits | Nov 16, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Off-Policy Evaluation for Action-Dependent Non-Stationary Environments | Jan 24, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 | 5 |
| Computing the Feedback Capacity of Finite State Channels using Reinforcement Learning | Jan 27, 2020 | Computational EfficiencyDecision Making | CodeCode Available | 0 | 5 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 26, 2017 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| Online Decision Making with History-Average Dependent Costs (Extended) | Dec 11, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Distance Weighted Supervised Learning for Offline Interaction Data | Apr 26, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Contextual Bandits with Large Action Spaces: Made Practical | Jul 12, 2022 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| PageRank Bandits for Link Prediction | Nov 3, 2024 | Decision MakingGraph Learning | CodeCode Available | 0 | 5 |
| Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot Navigation | Mar 26, 2025 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 | 5 |
| Planning with Goal-Conditioned Policies | Nov 19, 2019 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning | Jan 20, 2023 | Decision Makingmodel | CodeCode Available | 0 | 5 |
| Continuous Monte Carlo Graph Search | Oct 4, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling | Apr 11, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping | Sep 14, 2017 | Decision MakingImage Cropping | CodeCode Available | 0 | 5 |
| Differentially Private Regret Minimization in Episodic Markov Decision Processes | Dec 20, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| DeLF: Designing Learning Environments with Foundation Models | Jan 17, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |