| The Cell Must Go On: Agar.io for Continual Reinforcement Learning | May 23, 2025 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy | May 18, 2025 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| Reasoning on a Budget: Miniaturizing DeepSeek R1 with SFT-GRPO Alignment for Instruction-Tuned LLMs | May 16, 2025 | Deep Reinforcement LearningMathematical Reasoning | CodeCode Available | 1 |
| Evaluating Robustness of Deep Reinforcement Learning for Autonomous Surface Vehicle Control in Field Tests | May 15, 2025 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration | May 8, 2025 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Neurophysiologically Realistic Environment for Comparing Adaptive Deep Brain Stimulation Algorithms in Parkinson Disease | Apr 26, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Learning Decision Trees as Amortized Structure Inference | Mar 10, 2025 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 |
| Dynamics-Invariant Quadrotor Control using Scale-Aware Deep Reinforcement Learning | Mar 9, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning | Mar 8, 2025 | Deep Reinforcement LearningRepresentation Learning | CodeCode Available | 1 |
| Playing Pokémon Red via Deep Reinforcement Learning | Feb 27, 2025 | Deep Reinforcement LearningLanguage Modeling | CodeCode Available | 1 |