| LISA: Learning Interpretable Skill Abstractions from Language | Feb 28, 2022 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game Approach | Mar 30, 2025 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework | Mar 26, 2025 | Bayesian OptimizationSequential Decision Making | CodeCode Available | 0 |
| Algorithms for Fairness in Sequential Decision Making | Jan 24, 2019 | Decision MakingFairness | CodeCode Available | 0 |
| Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs | Dec 18, 2023 | Decision MakingKnowledge Graphs | CodeCode Available | 0 |
| Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes | Jan 3, 2024 | Decision MakingHeuristic Search | CodeCode Available | 0 |
| Adversarial Environment Generation for Learning to Navigate the Web | Mar 2, 2021 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Scalable Bayesian optimization with high-dimensional outputs using randomized prior networks | Feb 14, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents | Feb 6, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Data Generation as Sequential Decision Making | Jun 10, 2015 | Decision MakingImputation | CodeCode Available | 0 |
| Long-Term Fair Decision Making through Deep Generative Models | Jan 20, 2024 | Decision MakingFairness | CodeCode Available | 0 |
| Long-Term Fairness in Sequential Multi-Agent Selection with Positive Reinforcement | Jul 10, 2024 | Decision MakingFairness | CodeCode Available | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Toward Policy Explanations for Multi-Agent Reinforcement Learning | Apr 26, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Loss Bounds for Approximate Influence-Based Abstraction | Nov 3, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Federated Online Clustering of Bandits | Aug 31, 2022 | ClusteringDecision Making | CodeCode Available | 0 |
| Batch Bayesian optimisation via density-ratio estimation with guarantees | Sep 22, 2022 | Bayesian InferenceBayesian Optimisation | CodeCode Available | 0 |
| Finding Counterfactually Optimal Action Sequences in Continuous State Spaces | Jun 6, 2023 | Causal InferenceDecision Making | CodeCode Available | 0 |
| Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings | Jan 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| SCALES: From Fairness Principles to Constrained Decision-Making | Sep 22, 2022 | Decision MakingFairness | CodeCode Available | 0 |
| MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for Python | Oct 4, 2019 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Jul 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits | May 22, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | May 20, 2018 | Decision Makingreinforcement-learning | CodeCode Available | 0 |