| Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling | Apr 11, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Dec 29, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Learning Embeddings for Sequential Tasks Using Population of Agents | Jun 5, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Learning Non-myopic Power Allocation in Constrained Scenarios | Jan 18, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Deep Variational Reinforcement Learning for POMDPs | Jun 6, 2018 | Decision MakingInductive Bias | CodeCode Available | 0 | 5 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Reinforcement Learning applied to Insurance Portfolio Pursuit | Aug 1, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Distance Weighted Supervised Learning for Offline Interaction Data | Apr 26, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |