| Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models | Apr 18, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations | Apr 10, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 |
| Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes | Apr 1, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services | Mar 28, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty | Mar 23, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects | Mar 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| The price of unfairness in linear bandits with biased feedback | Mar 18, 2022 | AttributeDecision Making | —Unverified | 0 |
| Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism | Mar 11, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Trainable Approach to Zero-delay Smoothing Spline Interpolation | Mar 7, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |