| Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process | Mar 25, 2024 | Active LearningMuJoCo | —Unverified | 0 |
| Robust Model Based Reinforcement Learning Using L_1 Adaptive Control | Mar 21, 2024 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Phasic Diversity Optimization for Population-Based Reinforcement Learning | Mar 17, 2024 | DiversityMuJoCo | —Unverified | 0 |
| A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization | Mar 17, 2024 | MuJoCo | —Unverified | 0 |
| Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning | Mar 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| DeepSafeMPC: Deep Learning-Based Model Predictive Control for Safe Multi-Agent Reinforcement Learning | Mar 11, 2024 | Model Predictive ControlMuJoCo | —Unverified | 0 |
| Conservative DDPG -- Pessimistic RL without Ensemble | Mar 8, 2024 | MuJoCo | —Unverified | 0 |
| Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning | Mar 4, 2024 | Atari Gamescontinuous-control | —Unverified | 0 |
| Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL) | Mar 2, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency | Mar 1, 2024 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory | Feb 26, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies | Feb 20, 2024 | Adversarial AttackMuJoCo | CodeCode Available | 0 |
| Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics | Feb 17, 2024 | MuJoCoRepresentation Learning | CodeCode Available | 0 |
| Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains | Feb 9, 2024 | Depth EstimationMuJoCo | —Unverified | 0 |
| ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation | Feb 7, 2024 | MuJoCo | —Unverified | 0 |
| Compressing Deep Reinforcement Learning Networks with a Dynamic Structured Pruning Method for Autonomous Driving | Feb 7, 2024 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Accelerating Inverse Reinforcement Learning with Expert Bootstrapping | Feb 4, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| SQT -- std Q-target | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 |
| MinMaxMin Q-learning | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 |
| Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning | Feb 1, 2024 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| A Reinforcement Learning Based Controller to Minimize Forces on the Crutches of a Lower-Limb Exoskeleton | Jan 31, 2024 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator | Jan 30, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Episodic Reinforcement Learning with Expanded State-reward Space | Jan 19, 2024 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| AgentMixer: Multi-Agent Correlated Policy Factorization | Jan 16, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Neural Population Learning beyond Symmetric Zero-sum Games | Jan 10, 2024 | MuJoCoTransfer Learning | —Unverified | 0 |