| Phasic Diversity Optimization for Population-Based Reinforcement Learning | Mar 17, 2024 | DiversityMuJoCo | —Unverified | 0 |
| Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning | Mar 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| DeepSafeMPC: Deep Learning-Based Model Predictive Control for Safe Multi-Agent Reinforcement Learning | Mar 11, 2024 | Model Predictive ControlMuJoCo | —Unverified | 0 |
| Conservative DDPG -- Pessimistic RL without Ensemble | Mar 8, 2024 | MuJoCo | —Unverified | 0 |
| Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning | Mar 4, 2024 | Atari Gamescontinuous-control | —Unverified | 0 |
| Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL) | Mar 2, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency | Mar 1, 2024 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory | Feb 26, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies | Feb 20, 2024 | Adversarial AttackMuJoCo | CodeCode Available | 0 |
| Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics | Feb 17, 2024 | MuJoCoRepresentation Learning | CodeCode Available | 0 |