| CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture | Sep 28, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |
| Adapting Double Q-Learning for Continuous Reinforcement Learning | Sep 25, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Iterative Reachability Estimation for Safe Reinforcement Learning | Sep 24, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Distributionally Robust Statistical Verification with Imprecise Neural Networks | Aug 28, 2023 | Active LearningMuJoCo | —Unverified | 0 |
| Careful at Estimation and Bold at Exploration | Aug 22, 2023 | MuJoCo | —Unverified | 0 |
| Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization | Aug 13, 2023 | LEMMAMuJoCo | CodeCode Available | 0 |
| BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization | Aug 1, 2023 | Bilevel OptimizationDiversity | CodeCode Available | 0 |
| DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes | Aug 1, 2023 | Computational EfficiencyGrasp Generation | —Unverified | 0 |
| Variance Control for Distributional Reinforcement Learning | Jul 30, 2023 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning | Jul 24, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs | Jul 21, 2023 | MuJoCoRepresentation Learning | —Unverified | 0 |
| Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment | Jul 20, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning | Jul 4, 2023 | Data AugmentationDiversity | —Unverified | 0 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| CEIL: Generalized Contextual Imitation Learning | Jun 26, 2023 | D4RLImitation Learning | —Unverified | 0 |
| Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation | Jun 23, 2023 | Few-Shot Image ClassificationFew-Shot Imitation Learning | CodeCode Available | 0 |
| Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication | Jun 20, 2023 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback | Jun 20, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Surfer: Progressive Reasoning with World Models for Robotic Manipulation | Jun 20, 2023 | Decision MakingMuJoCo | —Unverified | 0 |
| AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents | Jun 19, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Mimicking Better by Matching the Approximate Action Distribution | Jun 16, 2023 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Recurrent Action Transformer with Memory | Jun 15, 2023 | Atari GamesMuJoCo | CodeCode Available | 0 |
| Language to Rewards for Robotic Skill Synthesis | Jun 14, 2023 | In-Context LearningLogical Reasoning | —Unverified | 0 |
| Robust Reinforcement Learning through Efficient Adversarial Herding | Jun 12, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 |