| Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed Environments | Jun 12, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Learning Embeddings for Sequential Tasks Using Population of Agents | Jun 5, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies | Dec 1, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| β-Multivariational Autoencoder for Entangled Representation Learning in Video Frames | Nov 22, 2022 | Decision MakingObject | CodeCode Available | 0 |
| Best Arm Identification for Stochastic Rising Bandits | Feb 15, 2023 | Decision MakingModel Selection | CodeCode Available | 0 |
| A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro Data | Nov 11, 2019 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Collaborative Comic Generation: Integrating Visual Narrative Theories with AI Models for Enhanced Creativity | Sep 25, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 26, 2017 | Atari GamesDecision Making | CodeCode Available | 0 |
| Learning model-based planning from scratch | Jul 19, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Enhancing Heterogeneous Multi-Agent Cooperation in Decentralized MARL via GNN-driven Intrinsic Rewards | Aug 12, 2024 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 0 |