| C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory | Feb 26, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies | Feb 20, 2024 | Adversarial AttackMuJoCo | CodeCode Available | 0 |
| Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics | Feb 17, 2024 | MuJoCoRepresentation Learning | CodeCode Available | 0 |
| Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains | Feb 9, 2024 | Depth EstimationMuJoCo | —Unverified | 0 |
| ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation | Feb 7, 2024 | MuJoCo | —Unverified | 0 |
| Compressing Deep Reinforcement Learning Networks with a Dynamic Structured Pruning Method for Autonomous Driving | Feb 7, 2024 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Accelerating Inverse Reinforcement Learning with Expert Bootstrapping | Feb 4, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| SQT -- std Q-target | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 |
| MinMaxMin Q-learning | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 |
| Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning | Feb 1, 2024 | Imitation LearningMuJoCo | CodeCode Available | 0 |