| Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems | Dec 14, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Reinforced Approximate Exploratory Data Analysis | Dec 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Information-Theoretic Safe Exploration with Gaussian Processes | Dec 9, 2022 | Decision MakingGaussian Processes | CodeCode Available | 0 |
| Model-based trajectory stitching for improved behavioural cloning and its applications | Dec 8, 2022 | Behavioural cloningBenchmarking | —Unverified | 0 |
| Automaton-Based Representations of Task Knowledge from Generative Language Models | Dec 4, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance | Dec 4, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox | Dec 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy Approach | Nov 28, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |