| Effects of sparse rewards of different magnitudes in the speed of learning of model-based actor critic methods | Jan 18, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Efficient Diversity-based Experience Replay for Deep Reinforcement Learning | Oct 27, 2024 | Atari GamesDecision Making | —Unverified | 0 |
| Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration | Sep 27, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling | Oct 7, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States | Oct 1, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| ELSIM: End-to-end learning of reusable skills through intrinsic motivation | Jun 23, 2020 | Developmental LearningMuJoCo | —Unverified | 0 |
| Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning | Jul 4, 2023 | Data AugmentationDiversity | —Unverified | 0 |
| Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Benchmarking the Sim-to-Real Gap in Cloth Manipulation | Oct 14, 2023 | BenchmarkingMuJoCo | —Unverified | 0 |
| ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation | Feb 7, 2024 | MuJoCo | —Unverified | 0 |
| EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning | Jul 22, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Entropy Augmented Reinforcement Learning | Aug 19, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback | Jun 20, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Feb 6, 2025 | Dataset GenerationMuJoCo | —Unverified | 0 |
| Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO) | Feb 1, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Episodic Reinforcement Learning with Expanded State-reward Space | Jan 19, 2024 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL | May 14, 2021 | Inductive BiasMeta Reinforcement Learning | —Unverified | 0 |
| Evaluating Robustness of Cooperative MARL | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| DIDA: Denoised Imitation Learning based on Domain Adaptation | Apr 4, 2024 | Domain AdaptationImitation Learning | —Unverified | 0 |
| Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication | Jun 20, 2023 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic Manipulation of Deformable Linear Objects | Dec 23, 2023 | MuJoCoPosition | —Unverified | 0 |
| Evolving Rewards to Automate Reinforcement Learning | May 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models | May 18, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning | Feb 17, 2025 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| A Logarithmic Barrier Method For Proximal Policy Optimization | Dec 16, 2018 | MuJoCoReinforcement Learning | —Unverified | 0 |