| CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture | Sep 28, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |
| Adapting Double Q-Learning for Continuous Reinforcement Learning | Sep 25, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Iterative Reachability Estimation for Safe Reinforcement Learning | Sep 24, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Distributionally Robust Statistical Verification with Imprecise Neural Networks | Aug 28, 2023 | Active LearningMuJoCo | —Unverified | 0 |
| Careful at Estimation and Bold at Exploration | Aug 22, 2023 | MuJoCo | —Unverified | 0 |
| Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization | Aug 13, 2023 | LEMMAMuJoCo | CodeCode Available | 0 |
| BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization | Aug 1, 2023 | Bilevel OptimizationDiversity | CodeCode Available | 0 |
| DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes | Aug 1, 2023 | Computational EfficiencyGrasp Generation | —Unverified | 0 |
| Variance Control for Distributional Reinforcement Learning | Jul 30, 2023 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning | Jul 24, 2023 | continuous-controlContinuous Control | —Unverified | 0 |