| Exclusively Penalized Q-learning for Offline Reinforcement Learning | May 23, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning from Datasets with Structured Non-Stationarity | May 23, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Offline RL via Feature-Occupancy Gradient Ascent | May 22, 2024 | Offline RL | —Unverified | 0 |
| Efficient Imitation Learning with Conservative World Models | May 21, 2024 | Imitation LearningOffline RL | —Unverified | 0 |
| Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | May 20, 2024 | Atari GamesMamba | CodeCode Available | 0 |
| Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses | May 18, 2024 | D4RLOffline RL | —Unverified | 0 |
| Reinformer: Max-Return Sequence Modeling for Offline RL | May 14, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning | May 12, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based Planning | May 7, 2024 | Offline RLRobot Manipulation | CodeCode Available | 1 |
| Improving Offline Reinforcement Learning with Inaccurate Simulators | May 7, 2024 | D4RLGenerative Adversarial Network | —Unverified | 0 |
| Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows | May 6, 2024 | Causal Inferencecounterfactual | —Unverified | 0 |
| Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning | May 6, 2024 | Offline RL | —Unverified | 0 |
| Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning | Apr 30, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly | Apr 26, 2024 | Contact-rich ManipulationOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Behavioral Supervisor Tuning | Apr 25, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems | Apr 19, 2024 | Efficient ExplorationMulti-Task Learning | —Unverified | 0 |
| Data-Incremental Continual Offline Reinforcement Learning | Apr 19, 2024 | Continual LearningOffline RL | —Unverified | 0 |
| TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents | Apr 18, 2024 | energy managementOffline RL | CodeCode Available | 0 |
| Offline Trajectory Generalization for Offline Reinforcement Learning | Apr 16, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Apr 15, 2024 | GPUOffline RL | —Unverified | 0 |
| Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains | Apr 11, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Generative Probabilistic Planning for Optimizing Supply Chain Networks | Apr 11, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning | Apr 6, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning | Mar 29, 2024 | counterfactualOffline RL | —Unverified | 0 |
| Scaling Vision-and-Language Navigation With Offline RL | Mar 27, 2024 | Offline RLVision and Language Navigation | —Unverified | 0 |
| Uncertainty-aware Distributional Offline Reinforcement Learning | Mar 26, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling | Mar 25, 2024 | Offline RLRecommendation Systems | —Unverified | 0 |
| The Value of Reward Lookahead in Reinforcement Learning | Mar 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning | Mar 14, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning | Mar 9, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Why Online Reinforcement Learning is Causal | Mar 7, 2024 | counterfactualOffline RL | —Unverified | 0 |
| Offline Fictitious Self-Play for Competitive Games | Feb 29, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings | Feb 27, 2024 | DiversityOffline RL | CodeCode Available | 2 |
| Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding | Feb 23, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Feb 21, 2024 | Decision MakingImitation Learning | CodeCode Available | 2 |
| MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces | Feb 20, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 |
| Offline Multi-task Transfer RL with Representational Penalization | Feb 19, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning | Feb 16, 2024 | Metric LearningOffline RL | —Unverified | 0 |
| Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning | Feb 15, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Measurement Scheduling for ICU Patients with Offline Reinforcement Learning | Feb 12, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL | Feb 11, 2024 | Offline RL | CodeCode Available | 1 |
| More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning | Feb 11, 2024 | Distributional Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 |
| Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning | Feb 8, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Offline Actor-Critic Reinforcement Learning Scales to Large Models | Feb 8, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs | Feb 7, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| SEABO: A Simple Search-Based Method for Offline Imitation Learning | Feb 6, 2024 | D4RLImitation Learning | CodeCode Available | 1 |
| Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning | Feb 5, 2024 | Contrastive LearningD4RL | —Unverified | 0 |