Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning Jun 30, 2024 D4RL Offline RL
— Unverified 0DEAR: Disentangled Environment and Agent Representations for Reinforcement Learning without Reconstruction Jun 30, 2024 Reinforcement Learning (RL)
Code Code Available 0Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators Jun 30, 2024 Autonomous Vehicles Offline RL
— Unverified 0Medical Knowledge Integration into Reinforcement Learning Algorithms for Dynamic Treatment Regimes Jun 29, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A Review of Safe Reinforcement Learning Methods for Modern Power Systems Jun 29, 2024 energy management Reinforcement Learning (RL)
— Unverified 0Digital Twin-Assisted Data-Driven Optimization for Reliable Edge Caching in Wireless Networks Jun 29, 2024 Reinforcement Learning (RL)
— Unverified 0External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling Jun 28, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels Jun 28, 2024 Reinforcement Learning (RL)
— Unverified 0Operator World Models for Reinforcement Learning Jun 28, 2024 Decision Making reinforcement-learning
Code Code Available 0Optimizing Cyber Defense in Dynamic Active Directories through Reinforcement Learning Jun 28, 2024 Blocking Diversity
— Unverified 0Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors Jun 28, 2024 Car Racing Q-Learning
Code Code Available 0Reinforcement Learning for Efficient Design and Control Co-optimisation of Energy Systems Jun 28, 2024 energy management Management
— Unverified 0Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing Reinforcement Learning Programs Jun 28, 2024 Reinforcement Learning (RL)
Code Code Available 0Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs Jun 28, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning Jun 27, 2024 Reinforcement Learning (RL)
— Unverified 0Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion Jun 27, 2024 Code Generation Natural Language Inference
— Unverified 0Multi-agent Cooperative Games Using Belief Map Assisted Training Jun 27, 2024 Reinforcement Learning (RL)
Code Code Available 0Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks Jun 26, 2024 Autonomous Vehicles Decision Making
— Unverified 0Preference Elicitation for Offline Reinforcement Learning Jun 26, 2024 Offline RL reinforcement-learning
— Unverified 0Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control Jun 26, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Combining Automated Optimisation of Hyperparameters and Reward Shape Jun 26, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0EXTRACT: Efficient Policy Learning by Extracting Transferable Robot Skills from Offline Data Jun 25, 2024 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0Privacy Preserving Reinforcement Learning for Population Processes Jun 25, 2024 Privacy Preserving reinforcement-learning
— Unverified 0Leveraging Reinforcement Learning in Red Teaming for Advanced Ransomware Attack Simulations Jun 25, 2024 Red Teaming Reinforcement Learning (RL)
— Unverified 0ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback Jun 25, 2024 Reinforcement Learning (RL) Sentence
Code Code Available 0Human-Object Interaction from Human-Level Instructions Jun 25, 2024 Common Sense Reasoning Human-Object Interaction Detection
— Unverified 0The State-Action-Reward-State-Action Algorithm in Spatial Prisoner's Dilemma Game Jun 25, 2024 Decision Making Imitation Learning
— Unverified 0OCALM: Object-Centric Assessment with Language Models Jun 24, 2024 Object Reinforcement Learning (RL)
— Unverified 0Decentralized RL-Based Data Transmission Scheme for Energy Efficient Harvesting Jun 24, 2024 Reinforcement Learning (RL)
— Unverified 0Confidence Aware Inverse Constrained Reinforcement Learning Jun 24, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Uncertainty-Aware Reward-Free Exploration with General Function Approximation Jun 24, 2024 Reinforcement Learning (RL)
Code Code Available 0KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning Jun 24, 2024 Hierarchical Reinforcement Learning Knowledge Graphs
Code Code Available 0Reinforcement Learning via Auxiliary Task Distillation Jun 24, 2024 Object Rearrangement reinforcement-learning
Code Code Available 0Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems Jun 24, 2024 Autonomous Vehicles Reinforcement Learning (RL)
— Unverified 0Diffusion Spectral Representation for Reinforcement Learning Jun 23, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary Jun 23, 2024 Card Games Reinforcement Learning (RL)
Code Code Available 0Multistep Criticality Search and Power Shaping in Microreactors with Reinforcement Learning Jun 22, 2024 energy management Reinforcement Learning (RL)
— Unverified 0Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models Jun 22, 2024 Reinforcement Learning (RL) SMAC
Code Code Available 0Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning Jun 21, 2024 Reinforcement Learning (RL)
— Unverified 0SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning Jun 21, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0KalMamba: Towards Efficient Probabilistic State Space Models for RL under Uncertainty Jun 21, 2024 Computational Efficiency Mamba
— Unverified 0Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue Jun 20, 2024 Dialogue State Tracking Reinforcement Learning (RL)
— Unverified 0Optimizing Novelty of Top-k Recommendations using Large Language Models and Reinforcement Learning Jun 20, 2024 Product Recommendation Reinforcement Learning (RL)
— Unverified 0Resource Optimization for Tail-Based Control in Wireless Networked Control Systems Jun 20, 2024 GPR Prediction
— Unverified 0Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing Jun 20, 2024 Autonomous Driving Data Augmentation
— Unverified 0Revealing the learning process in reinforcement learning agents through attention-oriented metrics Jun 20, 2024 Reinforcement Learning (RL)
— Unverified 0Beyond Optimism: Exploration With Partially Observable Rewards Jun 20, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 0Equivariant Offline Reinforcement Learning Jun 20, 2024 Offline RL Q-Learning
— Unverified 0Imagining In-distribution States: How Predictable Robot Behavior Can Enable User Control Over Learned Policies Jun 19, 2024 Reinforcement Learning (RL)
Code Code Available 0Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond Jun 19, 2024 Decision Making reinforcement-learning
— Unverified 0