| Energy-Weighted Flow Matching for Offline Reinforcement Learning | Mar 6, 2025 | Offline RLreinforcement-learning | —Unverified | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Yes, Q-learning Helps Offline In-Context RL | Feb 24, 2025 | In-Context Reinforcement LearningMuJoCo | —Unverified | 0 |
| Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective | Feb 17, 2025 | Bayesian Optimizationmodel | —Unverified | 0 |
| Which Features are Best for Successor Features? | Feb 15, 2025 | Offline RL | —Unverified | 0 |
| Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches | Feb 13, 2025 | D4RLOffline RL | —Unverified | 0 |
| Active Advantage-Aligned Online Reinforcement Learning with Offline Data | Feb 11, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits | Feb 7, 2025 | InformativenessOffline RL | —Unverified | 0 |
| Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Feb 6, 2025 | Dataset GenerationMuJoCo | —Unverified | 0 |
| OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds | Feb 5, 2025 | Few-Shot LearningImitation Learning | —Unverified | 0 |
| Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation | Feb 4, 2025 | feature selectionOffline RL | —Unverified | 0 |
| Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning | Feb 3, 2025 | Meta-LearningOffline RL | —Unverified | 0 |
| Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback | Jan 27, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Data Center Cooling System Optimization Using Offline Reinforcement Learning | Jan 25, 2025 | Graph Neural NetworkOffline RL | —Unverified | 0 |
| Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies | Jan 24, 2025 | MuJoCoOffline RL | CodeCode Available | 0 |
| Large Language Model driven Policy Exploration for Recommender Systems | Jan 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DRDT3: Diffusion-Refined Decision Test-Time Training Model | Jan 12, 2025 | D4RLOffline RL | —Unverified | 0 |
| SR-Reward: Taking The Path More Traveled | Jan 4, 2025 | D4RLImitation Learning | —Unverified | 0 |
| On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures | Jan 3, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Goal-Conditioned Data Augmentation for Offline Reinforcement Learning | Dec 29, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL | Dec 25, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization | Dec 24, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| AdaCred: Adaptive Causal Decision Transformers with Feature Crediting | Dec 19, 2024 | AttributeImitation Learning | —Unverified | 0 |
| Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning | Dec 11, 2024 | Autonomous DrivingOffline RL | CodeCode Available | 0 |
| Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone | Dec 9, 2024 | global-optimizationImitation Learning | —Unverified | 0 |
| Reinforcement Learning: An Overview | Dec 6, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Dec 5, 2024 | D4RLOffline RL | —Unverified | 0 |
| Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Dec 3, 2024 | ObjectOffline RL | —Unverified | 0 |
| Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization | Nov 27, 2024 | Computational EfficiencyOffline RL | —Unverified | 0 |
| LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble | Nov 26, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement | Nov 26, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Continual Task Learning through Adaptive Policy Self-Composition | Nov 18, 2024 | Continual LearningOffline RL | CodeCode Available | 0 |
| Preserving Expert-Level Privacy in Offline Reinforcement Learning | Nov 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning | Nov 12, 2024 | Imitation LearningOffline RL | —Unverified | 0 |
| Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC | Nov 11, 2024 | Offline RL | —Unverified | 0 |
| OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control | Nov 10, 2024 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Real-World Offline Reinforcement Learning from Vision Language Model Feedback | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning | Nov 7, 2024 | Offline RLPolicy Gradient Methods | —Unverified | 0 |
| Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions | Nov 1, 2024 | Bayesian InferenceOffline RL | CodeCode Available | 0 |
| Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation | Oct 30, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation | Oct 30, 2024 | D4RLManagement | CodeCode Available | 0 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 |
| Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces | Oct 21, 2024 | Continual LearningLifelong learning | —Unverified | 0 |
| Off-dynamics Conditional Diffusion Planners | Oct 16, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 |
| Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task | Oct 15, 2024 | ARCDecision Making | —Unverified | 0 |
| DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Oct 15, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning | Oct 15, 2024 | Collision AvoidanceOffline RL | —Unverified | 0 |
| Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm | Oct 13, 2024 | ManagementOffline RL | —Unverified | 0 |