| Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Feb 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate | Sep 7, 2023 | Decision MakingFairness | CodeCode Available | 0 |
| Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs | Oct 17, 2023 | counterfactualDecision Making | CodeCode Available | 0 |
| Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling | Apr 11, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Online Decision Making with History-Average Dependent Costs (Extended) | Dec 11, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling | Apr 26, 2022 | Decision MakingEvolutionary Algorithms | CodeCode Available | 0 |
| Online Learning with Costly Features in Non-stationary Environments | Jul 18, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| SOPE: Spectrum of Off-Policy Estimators | Nov 6, 2021 | Decision MakingOff-policy evaluation | CodeCode Available | 0 |
| Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot Actions | May 4, 2022 | Decision MakingGraph Embedding | CodeCode Available | 0 |
| Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Feb 14, 2025 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation | May 20, 2025 | Computational Efficiencycontinuous-control | CodeCode Available | 0 |
| Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability Objectives | Jun 5, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes | Jan 29, 2022 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning | May 29, 2020 | Autonomous VehiclesBoard Games | CodeCode Available | 0 |
| TORE: Token Recycling in Vision Transformers for Efficient Active Visual Exploration | Nov 26, 2023 | Decision MakingDecoder | CodeCode Available | 0 |
| Lifelong Learning with a Changing Action Set | Jun 5, 2019 | Decision MakingLifelong learning | CodeCode Available | 0 |
| TextAtari: 100K Frames Game Playing with Language Agents | Jun 4, 2025 | Atari GamesDecision Making | CodeCode Available | 0 |
| A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control | Mar 18, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Adversarially Robust Decision Transformer | Jul 25, 2024 | Adversarial RobustnessSequential Decision Making | CodeCode Available | 0 |
| Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Jan 16, 2025 | DiagnosticSequential Decision Making | CodeCode Available | 0 |
| Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight | Oct 2, 2017 | Autonomous VehiclesDecision Making | CodeCode Available | 0 |
| SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Nov 13, 2024 | Decision MakingGPU | CodeCode Available | 0 |
| Universal Off-Policy Evaluation | Apr 26, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| LISA: Learning Interpretable Skill Abstractions from Language | Feb 28, 2022 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game Approach | Mar 30, 2025 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework | Mar 26, 2025 | Bayesian OptimizationSequential Decision Making | CodeCode Available | 0 |
| Algorithms for Fairness in Sequential Decision Making | Jan 24, 2019 | Decision MakingFairness | CodeCode Available | 0 |
| Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs | Dec 18, 2023 | Decision MakingKnowledge Graphs | CodeCode Available | 0 |
| Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes | Jan 3, 2024 | Decision MakingHeuristic Search | CodeCode Available | 0 |
| Adversarial Environment Generation for Learning to Navigate the Web | Mar 2, 2021 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Scalable Bayesian optimization with high-dimensional outputs using randomized prior networks | Feb 14, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents | Feb 6, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Data Generation as Sequential Decision Making | Jun 10, 2015 | Decision MakingImputation | CodeCode Available | 0 |
| Long-Term Fair Decision Making through Deep Generative Models | Jan 20, 2024 | Decision MakingFairness | CodeCode Available | 0 |
| Long-Term Fairness in Sequential Multi-Agent Selection with Positive Reinforcement | Jul 10, 2024 | Decision MakingFairness | CodeCode Available | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Toward Policy Explanations for Multi-Agent Reinforcement Learning | Apr 26, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Loss Bounds for Approximate Influence-Based Abstraction | Nov 3, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Federated Online Clustering of Bandits | Aug 31, 2022 | ClusteringDecision Making | CodeCode Available | 0 |
| Batch Bayesian optimisation via density-ratio estimation with guarantees | Sep 22, 2022 | Bayesian InferenceBayesian Optimisation | CodeCode Available | 0 |
| Finding Counterfactually Optimal Action Sequences in Continuous State Spaces | Jun 6, 2023 | Causal InferenceDecision Making | CodeCode Available | 0 |
| Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings | Jan 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| SCALES: From Fairness Principles to Constrained Decision-Making | Sep 22, 2022 | Decision MakingFairness | CodeCode Available | 0 |
| MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for Python | Oct 4, 2019 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Jul 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits | May 22, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | May 20, 2018 | Decision Makingreinforcement-learning | CodeCode Available | 0 |