| On the Design of Safe Continual RL Methods for Control of Nonlinear Systems | Feb 21, 2025 | Continual LearningMuJoCo | CodeCode Available | 0 |
| CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning | Feb 17, 2025 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Maximum Entropy Reinforcement Learning with Diffusion Policy | Feb 17, 2025 | Efficient ExplorationMuJoCo | CodeCode Available | 1 |
| A Unifying Framework for Causal Imitation Learning with Hidden Confounders | Feb 11, 2025 | Imitation LearningMuJoCo | —Unverified | 0 |
| Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning | Feb 8, 2025 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Feb 6, 2025 | Dataset GenerationMuJoCo | —Unverified | 0 |
| Task-Aware Virtual Training: Enhancing Generalization in Meta-Reinforcement Learning for Out-of-Distribution Tasks | Feb 5, 2025 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| IRIS: An Immersive Robot Interaction System | Feb 5, 2025 | MuJoCoUnity | —Unverified | 0 |
| On Rollouts in Model-Based Reinforcement Learning | Jan 28, 2025 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |
| Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies | Jan 24, 2025 | MuJoCoOffline RL | CodeCode Available | 0 |
| Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data | Jan 13, 2025 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| TIMRL: A Novel Meta-Reinforcement Learning Framework for Non-Stationary and Multi-Task Environments | Jan 13, 2025 | Decision MakingMeta Reinforcement Learning | —Unverified | 0 |
| An Empirical Study of Deep Reinforcement Learning in Continuing Tasks | Jan 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| SALE-Based Offline Reinforcement Learning with Ensemble Q-Networks | Jan 7, 2025 | D4RLDiversity | —Unverified | 0 |
| Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning | Dec 19, 2024 | Continual LearningMuJoCo | —Unverified | 0 |
| Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning | Dec 19, 2024 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks | Dec 17, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Active Reinforcement Learning Strategies for Offline Policy Improvement | Dec 17, 2024 | Active Learningcontinuous-control | —Unverified | 0 |
| RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors | Dec 14, 2024 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| Inverse Delayed Reinforcement Learning | Dec 4, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance | Dec 1, 2024 | MuJoCo | —Unverified | 0 |
| Fast Convergence of Softmax Policy Mirror Ascent | Nov 18, 2024 | MuJoCo | —Unverified | 0 |
| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 |
| FM-TS: Flow Matching for Time Series Generation | Nov 12, 2024 | BenchmarkingImputation | CodeCode Available | 1 |
| Multi-Objective Algorithms for Learning Open-Ended Robotic Problems | Nov 11, 2024 | DiversityEvolutionary Algorithms | —Unverified | 0 |
| Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration | Nov 11, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Loss Landscapes in Preference Optimization | Nov 10, 2024 | MuJoCo | —Unverified | 0 |
| Scalable Kernel Inverse Optimization | Oct 31, 2024 | MuJoCo | CodeCode Available | 0 |
| Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity | Oct 31, 2024 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Solving Minimum-Cost Reach Avoid using Reinforcement Learning | Oct 29, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Learning Successor Features the Simple Way | Oct 29, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Efficient Diversity-based Experience Replay for Deep Reinforcement Learning | Oct 27, 2024 | Atari GamesDecision Making | —Unverified | 0 |
| Streaming Deep Reinforcement Learning Finally Works | Oct 18, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 |
| Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement | Oct 15, 2024 | DisentanglementInductive Bias | CodeCode Available | 2 |
| Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations | Oct 14, 2024 | Dimensionality ReductionMuJoCo | CodeCode Available | 1 |
| Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning | Oct 11, 2024 | DiversityMuJoCo | CodeCode Available | 1 |
| Neuroplastic Expansion in Deep Reinforcement Learning | Oct 10, 2024 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Quality Diversity Imitation Learning | Oct 8, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling | Oct 7, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments | Oct 4, 2024 | MuJoCo | —Unverified | 0 |
| ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization | Oct 2, 2024 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Learning to enhance multi-legged robot on rugged landscapes | Sep 14, 2024 | MuJoCo | —Unverified | 0 |
| Latent Space Energy-based Neural ODEs | Sep 5, 2024 | MuJoCo | —Unverified | 0 |
| Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning | Aug 27, 2024 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective | Aug 19, 2024 | MuJoCo | —Unverified | 0 |
| Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning | Aug 17, 2024 | Density EstimationImitation Learning | —Unverified | 0 |
| Cooperative Multi-Agent Deep Reinforcement Learning in Content Ranking Optimization | Aug 8, 2024 | Deep Reinforcement LearningInformation Retrieval | —Unverified | 0 |
| SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | Aug 4, 2024 | AttributeD4RL | —Unverified | 0 |
| MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBench | Aug 1, 2024 | Humanoid ControlMuJoCo | CodeCode Available | 5 |