| Learning Loss Landscapes in Preference Optimization | Nov 10, 2024 | MuJoCo | —Unverified | 0 |
| Scalable Kernel Inverse Optimization | Oct 31, 2024 | MuJoCo | CodeCode Available | 0 |
| Solving Minimum-Cost Reach Avoid using Reinforcement Learning | Oct 29, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Efficient Diversity-based Experience Replay for Deep Reinforcement Learning | Oct 27, 2024 | Atari GamesDecision Making | —Unverified | 0 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 |
| Neuroplastic Expansion in Deep Reinforcement Learning | Oct 10, 2024 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Quality Diversity Imitation Learning | Oct 8, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling | Oct 7, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments | Oct 4, 2024 | MuJoCo | —Unverified | 0 |
| ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization | Oct 2, 2024 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Learning to enhance multi-legged robot on rugged landscapes | Sep 14, 2024 | MuJoCo | —Unverified | 0 |
| Latent Space Energy-based Neural ODEs | Sep 5, 2024 | MuJoCo | —Unverified | 0 |
| Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning | Aug 27, 2024 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective | Aug 19, 2024 | MuJoCo | —Unverified | 0 |
| Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning | Aug 17, 2024 | Density EstimationImitation Learning | —Unverified | 0 |
| Cooperative Multi-Agent Deep Reinforcement Learning in Content Ranking Optimization | Aug 8, 2024 | Deep Reinforcement LearningInformation Retrieval | —Unverified | 0 |
| SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | Aug 4, 2024 | AttributeD4RL | —Unverified | 0 |
| On the Perturbed States for Transformed Input-robust Reinforcement Learning | Jul 31, 2024 | DenoisingMuJoCo | CodeCode Available | 0 |
| SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments | Jul 26, 2024 | MuJoCo | CodeCode Available | 0 |
| Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation | Jul 25, 2024 | MuJoCo | —Unverified | 0 |
| Learning Constraint Network from Demonstrations via Positive-Unlabeled Learning with Memory Replay | Jul 23, 2024 | MuJoCo | —Unverified | 0 |
| Proximal Policy Distillation | Jul 21, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Temporal Abstraction in Reinforcement Learning with Offline Data | Jul 21, 2024 | Hierarchical Reinforcement LearningMuJoCo | —Unverified | 0 |
| Constrained Intrinsic Motivation for Reinforcement Learning | Jul 12, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| A Review of Nine Physics Engines for Reinforcement Learning Research | Jul 11, 2024 | Decision MakingMuJoCo | —Unverified | 0 |