| Guided Exploration with Proximal Policy Optimization using a Single Demonstration | Jul 7, 2020 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Programmable and Customized Intelligence for Traffic Steering in 5G Networks Using Open RAN Architectures | Sep 28, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Defeating Proactive Jammers Using Deep Reinforcement Learning for Resource-Constrained IoT Networks | Jul 13, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanism | Jan 3, 2022 | Deep Reinforcement LearningGraph Representation Learning | CodeCode Available | 1 | 5 |
| Large Batch Simulation for Deep Reinforcement Learning | Mar 12, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 1 | 5 |
| DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems | Oct 8, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Real-Time Reinforcement Learning for Vision-Based Robotics Utilizing Local and Remote Computers | Oct 5, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Reasoning on a Budget: Miniaturizing DeepSeek R1 with SFT-GRPO Alignment for Instruction-Tuned LLMs | May 16, 2025 | Deep Reinforcement LearningMathematical Reasoning | CodeCode Available | 1 | 5 |
| Developing an OpenAI Gym-compatible framework and simulation environment for testing Deep Reinforcement Learning agents solving the Ambulance Location Problem | Jan 12, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents | Sep 29, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |