| Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance | Dec 1, 2024 | MuJoCo | —Unverified | 0 |
| Fast Convergence of Softmax Policy Mirror Ascent | Nov 18, 2024 | MuJoCo | —Unverified | 0 |
| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 |
| FM-TS: Flow Matching for Time Series Generation | Nov 12, 2024 | BenchmarkingImputation | CodeCode Available | 1 |
| Multi-Objective Algorithms for Learning Open-Ended Robotic Problems | Nov 11, 2024 | DiversityEvolutionary Algorithms | —Unverified | 0 |
| Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration | Nov 11, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Loss Landscapes in Preference Optimization | Nov 10, 2024 | MuJoCo | —Unverified | 0 |
| Scalable Kernel Inverse Optimization | Oct 31, 2024 | MuJoCo | CodeCode Available | 0 |
| Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity | Oct 31, 2024 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Solving Minimum-Cost Reach Avoid using Reinforcement Learning | Oct 29, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |