| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 |
| Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC | Nov 11, 2024 | Offline RL | —Unverified | 0 |
| OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control | Nov 10, 2024 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Real-World Offline Reinforcement Learning from Vision Language Model Feedback | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning | Nov 7, 2024 | Offline RLPolicy Gradient Methods | —Unverified | 0 |
| Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions | Nov 1, 2024 | Bayesian InferenceOffline RL | CodeCode Available | 0 |
| NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation | Oct 30, 2024 | D4RLManagement | CodeCode Available | 0 |
| Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation | Oct 30, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| LongReward: Improving Long-context Large Language Models with AI Feedback | Oct 28, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 2 |
| Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression | Oct 25, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces | Oct 21, 2024 | Continual LearningLifelong learning | —Unverified | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 |
| Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance | Oct 17, 2024 | Offline RLRe-Ranking | CodeCode Available | 1 |
| Off-dynamics Conditional Diffusion Planners | Oct 16, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning | Oct 15, 2024 | Collision AvoidanceOffline RL | —Unverified | 0 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 |
| DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Oct 15, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task | Oct 15, 2024 | ARCDecision Making | —Unverified | 0 |
| Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm | Oct 13, 2024 | ManagementOffline RL | —Unverified | 0 |
| Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare | Oct 10, 2024 | Common Sense ReasoningData Augmentation | —Unverified | 0 |
| The Smart Buildings Control Suite: A Diverse Open Source Benchmark to Evaluate and Scale HVAC Control Policies for Sustainability | Oct 2, 2024 | Model Predictive ControlOffline RL | —Unverified | 0 |
| ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization | Oct 2, 2024 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining | Oct 1, 2024 | Atari Gamesmodel | CodeCode Available | 1 |
| DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors | Sep 26, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| OffRIPP: Offline RL-based Informative Path Planning | Sep 25, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm | Sep 24, 2024 | Offline RLOff-policy evaluation | —Unverified | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 |
| Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning | Sep 12, 2024 | D4RLOffline RL | —Unverified | 0 |
| Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention | Sep 11, 2024 | Offline RL | —Unverified | 0 |
| The Role of Deep Learning Regularizations on Actors in Offline RL | Sep 11, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Tractable Offline Learning of Regular Decision Processes | Sep 4, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization | Sep 2, 2024 | DiversityOffline RL | CodeCode Available | 2 |
| Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning | Aug 28, 2024 | Drone navigationOffline RL | —Unverified | 0 |
| Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Unsupervised-to-Online Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning | Aug 23, 2024 | D4RLOffline RL | —Unverified | 0 |
| Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning | Aug 22, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Domain Adaptation for Offline Reinforcement Learning with Limited Samples | Aug 22, 2024 | Domain AdaptationOffline RL | —Unverified | 0 |
| Preference-Guided Reflective Sampling for Aligning Language Models | Aug 22, 2024 | Document SummarizationInstruction Following | CodeCode Available | 0 |
| Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Aug 20, 2024 | Multi-agent Reinforcement LearningMulti-Task Learning | CodeCode Available | 2 |
| Offline Model-Based Reinforcement Learning with Anti-Exploration | Aug 20, 2024 | D4RLmodel | —Unverified | 0 |
| Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba | Aug 20, 2024 | MambaOffline RL | —Unverified | 0 |
| Enhancing Reinforcement Learning Through Guided Search | Aug 19, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds | Aug 16, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning | Aug 15, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Experimental evaluation of offline reinforcement learning for HVAC control in buildings | Aug 15, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs | Aug 8, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Consistent time travel for realistic interactions with historical data: reinforcement learning for market making | Aug 5, 2024 | Offline RL | —Unverified | 0 |
| Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Jul 29, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |