| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Distance Weighted Supervised Learning for Offline Interaction Data | Apr 26, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling | Apr 11, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 | 5 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs | Oct 17, 2023 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Differentially Private Regret Minimization in Episodic Markov Decision Processes | Dec 20, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making | Oct 16, 2024 | Attributecounterfactual | CodeCode Available | 0 | 5 |
| Learning Discrete State Abstractions With Deep Variational Inference | Mar 9, 2020 | Decision MakingMulti-Goal Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 | 5 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 | 5 |
| Depth Matters: Multimodal RGB-D Perception for Robust Autonomous Agents | Mar 20, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Learning model-based planning from scratch | Jul 19, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning Non-myopic Power Allocation in Constrained Scenarios | Jan 18, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations | Apr 1, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Deep Variational Reinforcement Learning for POMDPs | Jun 6, 2018 | Decision MakingInductive Bias | CodeCode Available | 0 | 5 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Co-training for Policy Learning | Jul 3, 2019 | Combinatorial Optimizationcontinuous-control | CodeCode Available | 0 | 5 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 | 5 |
| A New Bandit Setting Balancing Information from State Evolution and Corrupted Context | Nov 16, 2020 | Decision MakingEfficient Exploration | CodeCode Available | 0 | 5 |
| DeLF: Designing Learning Environments with Foundation Models | Jan 17, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Back to the Future -- Sequential Alignment of Text Representations | Sep 8, 2019 | Decision MakingRumour Detection | CodeCode Available | 0 | 5 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight | Oct 2, 2017 | Autonomous VehiclesDecision Making | CodeCode Available | 0 | 5 |