HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding Feb 23, 2024 Imitation Learning Reinforcement Learning (RL)
Code Code Available 1Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning Feb 21, 2024 Cross-Modal Retrieval Image Captioning
Code Code Available 1XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques Feb 20, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Reflect-RL: Two-Player Online RL Fine-Tuning for LMs Feb 20, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Policy Learning for Off-Dynamics RL with Deficient Support Feb 16, 2024 Reinforcement Learning (RL)
Code Code Available 1Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Feb 15, 2024 GPU Reinforcement Learning (RL)
Code Code Available 1Hybrid Inverse Reinforcement Learning Feb 13, 2024 continuous-control Continuous Control
Code Code Available 1Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks Feb 9, 2024 Graph Neural Network reinforcement-learning
Code Code Available 1Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement Feb 9, 2024 Code Generation Decision Making
Code Code Available 1QGFN: Controllable Greediness with Action Values Feb 7, 2024 Diversity Reinforcement Learning (RL)
Code Code Available 1Safety Filters for Black-Box Dynamical Systems by Learning Discriminating Hyperplanes Feb 7, 2024 Reinforcement Learning (RL)
Code Code Available 1Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning Feb 6, 2024 D4RL Offline RL
Code Code Available 1SEABO: A Simple Search-Based Method for Offline Imitation Learning Feb 6, 2024 D4RL Imitation Learning
Code Code Available 1ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update Feb 1, 2024 Imitation Learning Offline RL
Code Code Available 1M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation Jan 30, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning Jan 24, 2024 Question Answering reinforcement-learning
Code Code Available 1DittoGym: Learning to Control Soft Shape-Shifting Robots Jan 24, 2024 Reinforcement Learning (RL)
Code Code Available 1HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments Jan 23, 2024 Common Sense Reasoning Decision Making
Code Code Available 1Stable and Safe Human-aligned Reinforcement Learning through Neural Ordinary Differential Equations Jan 23, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
Code Code Available 1Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning Jan 21, 2024 Reinforcement Learning (RL)
Code Code Available 1Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View Jan 20, 2024 Data Augmentation Reinforcement Learning (RL)
Code Code Available 1Bridging State and History Representations: Understanding Self-Predictive RL Jan 17, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems Jan 17, 2024 Diversity Fairness
Code Code Available 1Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint Jan 11, 2024 Question Answering Reinforcement Learning (RL)
Code Code Available 1Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents Jan 11, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1DMR: Decomposed Multi-Modality Representations for Frames and Events Fusion in Visual Reinforcement Learning Jan 1, 2024 Reinforcement Learning (RL)
Code Code Available 1Online Symbolic Music Alignment with Offline Reinforcement Learning Dec 31, 2023 Dynamic Time Warping Offline RL
Code Code Available 1Generalizable Visual Reinforcement Learning with Segment Anything Model Dec 28, 2023 Data Augmentation model
Code Code Available 1PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning Dec 26, 2023 Decision Making Deep Reinforcement Learning
Code Code Available 1Efficient Reinforcement Learning via Decoupling Exploration and Utilization Dec 26, 2023 Autonomous Vehicles MuJoCo
Code Code Available 1Critic-Guided Decision Transformer for Offline Reinforcement Learning Dec 21, 2023 D4RL Offline RL
Code Code Available 1Diffusion Reward: Learning Rewards via Conditional Video Diffusion Dec 21, 2023 Diversity Reinforcement Learning (RL)
Code Code Available 1RFRL Gym: A Reinforcement Learning Testbed for Cognitive Radio Applications Dec 20, 2023 OpenAI Gym reinforcement-learning
Code Code Available 1Challenges for Reinforcement Learning in Quantum Circuit Design Dec 18, 2023 Quantum Machine Learning reinforcement-learning
Code Code Available 1CACTO-SL: Using Sobolev Learning to improve Continuous Actor-Critic with Trajectory Optimization Dec 17, 2023 Reinforcement Learning (RL)
Code Code Available 1Learning to Act without Actions Dec 17, 2023 Reinforcement Learning (RL)
Code Code Available 1Active Reinforcement Learning for Robust Building Control Dec 16, 2023 Atari Games Game of Go
Code Code Available 1World Models via Policy-Guided Trajectory Diffusion Dec 13, 2023 continuous-control Continuous Control
Code Code Available 1The Effective Horizon Explains Deep RL Performance in Stochastic Environments Dec 13, 2023 Reinforcement Learning (RL)
Code Code Available 1Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach Dec 12, 2023 Knowledge Distillation Offline RL
Code Code Available 1Sequential Planning in Large Partially Observable Environments guided by LLMs Dec 12, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 1The Generalization Gap in Offline Reinforcement Learning Dec 10, 2023 Offline RL reinforcement-learning
Code Code Available 1Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator Dec 8, 2023 Model Predictive Control Multi-agent Reinforcement Learning
Code Code Available 1UniTSA: A Universal Reinforcement Learning Framework for V2X Traffic Signal Control Dec 8, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Mitigating Open-Vocabulary Caption Hallucinations Dec 6, 2023 Diversity Hallucination
Code Code Available 1Harnessing Discrete Representations For Continual Reinforcement Learning Dec 2, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach Dec 1, 2023 Deep Reinforcement Learning Edge-computing
Code Code Available 1Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms Nov 30, 2023 Benchmarking OpenAI Gym
Code Code Available 1Unveiling the Implicit Toxicity in Large Language Models Nov 29, 2023 Language Modelling Reinforcement Learning (RL)
Code Code Available 1Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents Nov 22, 2023 Decision Making Language Modeling
Code Code Available 1