| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 |
| FM-TS: Flow Matching for Time Series Generation | Nov 12, 2024 | BenchmarkingImputation | CodeCode Available | 1 |
| Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity | Oct 31, 2024 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Learning Successor Features the Simple Way | Oct 29, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations | Oct 14, 2024 | Dimensionality ReductionMuJoCo | CodeCode Available | 1 |
| Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning | Oct 11, 2024 | DiversityMuJoCo | CodeCode Available | 1 |
| LLM-Empowered State Representation for Reinforcement Learning | Jul 18, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning | Jul 17, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| RRLS : Robust Reinforcement Learning Suite | Jun 12, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning | Jun 7, 2024 | Contrastive LearningMeta Reinforcement Learning | CodeCode Available | 1 |
| Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | May 22, 2024 | IngenuityMuJoCo | CodeCode Available | 1 |
| S^2AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic | May 2, 2024 | MuJoCoVariational Inference | CodeCode Available | 1 |
| No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO | May 1, 2024 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| UCB-driven Utility Function Search for Multi-objective Reinforcement Learning | May 1, 2024 | Decision MakingMuJoCo | CodeCode Available | 1 |
| Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference | Feb 7, 2024 | MuJoCo | CodeCode Available | 1 |
| Efficient Reinforcement Learning via Decoupling Exploration and Utilization | Dec 26, 2023 | Autonomous VehiclesMuJoCo | CodeCode Available | 1 |
| World Models via Policy-Guided Trajectory Diffusion | Dec 13, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Optimistic Multi-Agent Policy Gradient | Nov 3, 2023 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning | Oct 19, 2023 | MuJoCoPrompt Engineering | CodeCode Available | 1 |
| Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling | Sep 20, 2023 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| A Bayesian Approach to Robust Inverse Reinforcement Learning | Sep 15, 2023 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization | Jul 21, 2023 | ManagementMuJoCo | CodeCode Available | 1 |
| Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation | Jul 17, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration | May 29, 2023 | MuJoCo | CodeCode Available | 1 |
| Policy Representation via Diffusion Probability Model for Reinforcement Learning | May 22, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |