| SS-MAIL: Self-Supervised Multi-Agent Imitation Learning | Oct 18, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network | Oct 16, 2021 | Behavioural cloningDecision Making | —Unverified | 0 |
| Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning | Oct 9, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits | Oct 8, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations | Oct 6, 2021 | Decision MakingNavigate | —Unverified | 0 |
| Gambits: Theory and Evidence | Oct 5, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams | Oct 2, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Decentralized Cross-Entropy Method for Model-Based Reinforcement Learning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Generalizing Successor Features to continuous domains for Multi-task Learning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |