| Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning | Mar 8, 2025 | Deep Reinforcement LearningRepresentation Learning | CodeCode Available | 1 |
| ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning | Mar 8, 2025 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning | Mar 7, 2025 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation | Mar 7, 2025 | Deep Reinforcement LearningOut-of-Distribution Detection | —Unverified | 0 |
| Can We Optimize Deep RL Policy Weights as Trajectory Modeling? | Mar 6, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services | Mar 6, 2025 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| PALo: Learning Posture-Aware Locomotion for Quadruped Robots | Mar 6, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Multi-Agent Inverse Q-Learning from Demonstrations | Mar 6, 2025 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Less is more? Rewards in RL for Cyber Defence | Mar 5, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Koopman-Based Generalization of Deep Reinforcement Learning With Application to Wireless Communications | Mar 4, 2025 | Deep Reinforcement Learning | —Unverified | 0 |