| Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process | Mar 25, 2024 | Active LearningMuJoCo | —Unverified | 0 |
| Robust Model Based Reinforcement Learning Using L_1 Adaptive Control | Mar 21, 2024 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Phasic Diversity Optimization for Population-Based Reinforcement Learning | Mar 17, 2024 | DiversityMuJoCo | —Unverified | 0 |
| A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization | Mar 17, 2024 | MuJoCo | —Unverified | 0 |
| Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning | Mar 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| DeepSafeMPC: Deep Learning-Based Model Predictive Control for Safe Multi-Agent Reinforcement Learning | Mar 11, 2024 | Model Predictive ControlMuJoCo | —Unverified | 0 |
| Conservative DDPG -- Pessimistic RL without Ensemble | Mar 8, 2024 | MuJoCo | —Unverified | 0 |
| Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning | Mar 4, 2024 | Atari Gamescontinuous-control | —Unverified | 0 |
| Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL) | Mar 2, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency | Mar 1, 2024 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |