| CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture | Sep 28, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |
| Careful at Estimation and Bold at Exploration | Aug 22, 2023 | MuJoCo | —Unverified | 0 |
| Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines? | Oct 27, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning | Feb 17, 2025 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| A Computational Theory of Learning Flexible Reward-Seeking Behavior with Place Cells | Apr 22, 2022 | MuJoCoOpen-Ended Question Answering | —Unverified | 0 |
| Formal Language Constrained Markov Decision Processes | Jan 1, 2021 | MuJoCo | —Unverified | 0 |
| Gaussian Process Policy Optimization | Mar 2, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations | Oct 24, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |
| CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning | Mar 23, 2025 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Multiagent Model-based Credit Assignment for Continuous Control | Dec 27, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains | May 12, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO) | Feb 1, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Adaptive Q-Network: On-the-fly Target Selection for Deep Reinforcement Learning | May 25, 2024 | Atari GamesAutoML | —Unverified | 0 |
| An Intelligent Social Learning-based Optimization Strategy for Black-box Robotic Control with Reinforcement Learning | Nov 11, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| A Computational Model of Learning Flexible Navigation in a Maze by Layout-Conforming Replay of Place Cells | Sep 18, 2022 | MuJoCo | —Unverified | 0 |
| First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation | Dec 6, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals | Aug 5, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States | Oct 1, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| BlockPuzzle - A Challenge in Physical Reasoning and Generalization for Robot Learning | Nov 30, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Sim2Sim Evaluation of a Novel Data-Efficient Differentiable Physics Engine for Tensegrity Robots | Nov 10, 2020 | MuJoCo | —Unverified | 0 |
| Adaptive N-step Bootstrapping with Off-policy Data | Jan 1, 2021 | Atari GamesMuJoCo | —Unverified | 0 |
| Biased Estimates of Advantages over Path Ensembles | Sep 15, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble | Dec 7, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts | Sep 29, 2021 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients | Jan 17, 2018 | MuJoCoSensitivity | —Unverified | 0 |
| A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem | May 26, 2023 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control | May 28, 2025 | GPUHumanoid Control | —Unverified | 0 |
| Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration | Jun 25, 2025 | Imitation LearningMuJoCo | —Unverified | 0 |
| Diverse Imitation Learning via Self-OrganizingGenerative Models | Sep 29, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| Distributionally Robust Statistical Verification with Imprecise Neural Networks | Aug 28, 2023 | Active LearningMuJoCo | —Unverified | 0 |
| Fast Convergence of Softmax Policy Mirror Ascent | Nov 18, 2024 | MuJoCo | —Unverified | 0 |
| Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments | Jul 19, 2022 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming | Jun 22, 2022 | Autonomous DrivingClassification | —Unverified | 0 |
| Gradientless Descent: High-Dimensional Zeroth-Order Optimization | Nov 14, 2019 | MuJoCoVocal Bursts Intensity Prediction | —Unverified | 0 |
| Distributional Decision Transformer for Hindsight Information Matching | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| DisTop: Discovering a Topological representation to learn diverse and rewarding skills | Jun 6, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning | Jul 4, 2023 | Data AugmentationDiversity | —Unverified | 0 |
| Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Benchmarking the Sim-to-Real Gap in Cloth Manipulation | Oct 14, 2023 | BenchmarkingMuJoCo | —Unverified | 0 |
| ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation | Feb 7, 2024 | MuJoCo | —Unverified | 0 |
| Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback | Jun 20, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning | Apr 2, 2025 | MuJoCoUncertainty Quantification | —Unverified | 0 |
| DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes | Aug 1, 2023 | Computational EfficiencyGrasp Generation | —Unverified | 0 |
| Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis | Jun 17, 2022 | MuJoCoStarcraft | —Unverified | 0 |
| Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation | Apr 14, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Feb 6, 2025 | Dataset GenerationMuJoCo | —Unverified | 0 |
| DropoutDAgger: A Bayesian Approach to Safe Imitation Learning | Sep 18, 2017 | Imitation LearningMuJoCo | —Unverified | 0 |
| Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization | Sep 1, 2022 | Bayesian InferenceKnowledge Distillation | —Unverified | 0 |
| DIDA: Denoised Imitation Learning based on Domain Adaptation | Apr 4, 2024 | Domain AdaptationImitation Learning | —Unverified | 0 |