Cross-Embodiment Robot Manipulation Skill Transfer using Latent Space Alignment Jun 4, 2024 Decoder Reinforcement Learning (RL)
Code Code Available 1Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning Jun 4, 2024 Mamba OpenAI Gym
Code Code Available 1FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning Jun 2, 2024 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1SUBER: An RL Environment with Simulated Human Behavior for Recommender Systems Jun 1, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 1In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought May 31, 2024 D4RL Decision Making
Code Code Available 1Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning May 31, 2024 D4RL Reinforcement Learning (RL)
Code Code Available 1Diffusion Policies creating a Trust Region for Offline Reinforcement Learning May 30, 2024 D4RL Denoising
Code Code Available 1DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime May 28, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 1Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL May 28, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 1Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination May 28, 2024 Offline RL reinforcement-learning
Code Code Available 1DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems May 27, 2024 Reinforcement Learning (RL)
Code Code Available 1Rethinking Transformers in Solving POMDPs May 27, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Q-value Regularized Transformer for Offline Reinforcement Learning May 27, 2024 D4RL Offline RL
Code Code Available 1Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization May 26, 2024 Reinforcement Learning (RL)
Code Code Available 1Cross-Domain Policy Adaptation by Capturing Representation Mismatch May 24, 2024 Reinforcement Learning (RL) Representation Learning
Code Code Available 1Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search May 24, 2024 Code Generation Language Modelling
Code Code Available 1Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate May 24, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Multi-turn Reinforcement Learning from Preference Human Feedback May 23, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning May 23, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow May 22, 2024 Ingenuity MuJoCo
Code Code Available 1Knowledge Graph Reasoning with Self-supervised Reinforcement Learning May 22, 2024 Knowledge Graphs reinforcement-learning
Code Code Available 1CausalPlayground: Addressing Data-Generation Requirements in Cutting-Edge Causality Research May 21, 2024 Reinforcement Learning (RL)
Code Code Available 1Feasibility Consistent Representation Learning for Safe Reinforcement Learning May 20, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Reinformer: Max-Return Sequence Modeling for Offline RL May 14, 2024 D4RL Offline RL
Code Code Available 1Value Augmented Sampling for Language Model Alignment and Personalization May 10, 2024 Language Modeling Language Modelling
Code Code Available 1Human-centric Reward Optimization for Reinforcement Learning-based Automated Driving using Large Language Models May 7, 2024 In-Context Learning Reinforcement Learning (RL)
Code Code Available 1Simulating the Economic Impact of Rationality through Reinforcement Learning and Agent-Based Modelling May 3, 2024 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO May 1, 2024 MuJoCo Reinforcement Learning (RL)
Code Code Available 1Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning Apr 30, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 1A fast balance optimization approach for charging enhancement of lithium-ion battery packs through deep reinforcement learning Apr 24, 2024 Deep Reinforcement Learning energy management
Code Code Available 1Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data Apr 22, 2024 Contrastive Learning Reinforcement Learning (RL)
Code Code Available 1WROOM: An Autonomous Driving Approach for Off-Road Navigation Apr 12, 2024 Autonomous Driving Reinforcement Learning (RL)
Code Code Available 1Dataset Reset Policy Optimization for RLHF Apr 12, 2024 Reinforcement Learning (RL)
Code Code Available 1How Consistent are Clinicians? Evaluating the Predictability of Sepsis Disease Progression with Dynamics Models Apr 10, 2024 Diversity Reinforcement Learning (RL)
Code Code Available 1Electric Vehicle Routing Problem for Emergency Power Supply: Towards Telecom Base Station Relief Apr 3, 2024 Reinforcement Learning (RL)
Code Code Available 1Entity-Centric Reinforcement Learning for Object Manipulation from Pixels Apr 1, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1The New Agronomists: Language Models are Experts in Crop Management Mar 28, 2024 Language Modelling Management
Code Code Available 1TractOracle: towards an anatomically-informed reward function for RL-based tractography Mar 26, 2024 Reinforcement Learning (RL)
Code Code Available 1Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems Mar 26, 2024 Bilevel Optimization Model Predictive Control
Code Code Available 1PeersimGym: An Environment for Solving the Task Offloading Problem with Reinforcement Learning Mar 26, 2024 Deep Reinforcement Learning Distributed Computing
Code Code Available 1Policy Bifurcation in Safe Reinforcement Learning Mar 19, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning Mar 19, 2024 Reinforcement Learning (RL) Visual Grounding
Code Code Available 1Reinforcement Learning with Token-level Feedback for Controllable Text Generation Mar 18, 2024 Attribute reinforcement-learning
Code Code Available 1Diffusion-Reinforcement Learning Hierarchical Motion Planning in Multi-agent Adversarial Games Mar 16, 2024 Autonomous Navigation Efficient Exploration
Code Code Available 1Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems Mar 6, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1SplAgger: Split Aggregation for Meta-Reinforcement Learning Mar 5, 2024 continuous-control Continuous Control
Code Code Available 1Improving the Validity of Automatically Generated Feedback via Reinforcement Learning Mar 2, 2024 Math Misconceptions
Code Code Available 1Large Language Models are Learnable Planners for Long-Term Recommendation Feb 29, 2024 Decision Making Language Modelling
Code Code Available 1Flexible Robust Beamforming for Multibeam Satellite Downlink using Reinforcement Learning Feb 26, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1How Can LLM Guide RL? A Value-Based Approach Feb 25, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1