Reinforcement Learning with Maskable Stock Representation for Portfolio Management in Customizable Stock Pools Nov 17, 2023 Management reinforcement-learning
Code Code Available 1Reinforcement Learning with Model Predictive Control for Highway Ramp Metering Nov 15, 2023 Model-based Reinforcement Learning Model Predictive Control
Code Code Available 1Direct Preference Optimization for Neural Machine Translation with Minimum Bayes Risk Decoding Nov 14, 2023 Machine Translation NMT
Code Code Available 1Combinatorial Optimization with Policy Adaptation using Latent Space Search Nov 13, 2023 Benchmarking Combinatorial Optimization
Code Code Available 1Accelerating Exploration with Unlabeled Prior Data Nov 9, 2023 Reinforcement Learning (RL)
Code Code Available 1Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization Nov 6, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1State-Wise Safe Reinforcement Learning With Pixel Observations Nov 3, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation Nov 3, 2023 Reinforcement Learning (RL)
Code Code Available 1Hierarchical Reinforcement Learning for Power Network Topology Control Nov 3, 2023 All Hierarchical Reinforcement Learning
Code Code Available 1Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning Oct 31, 2023 Few-Shot Learning Offline RL
Code Code Available 1DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization Oct 30, 2023 continuous-control Continuous Control
Code Code Available 1Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills Oct 30, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning Oct 27, 2023 D4RL Reinforcement Learning (RL)
Code Code Available 1CROP: Conservative Reward for Model-based Offline Policy Optimization Oct 26, 2023 D4RL Offline RL
Code Code Available 1Safe Navigation: Training Autonomous Vehicles using Deep Reinforcement Learning in CARLA Oct 23, 2023 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 1Diversify Question Generation with Retrieval-Augmented Style Transfer Oct 23, 2023 Diversity Question Answering
Code Code Available 1Contrastive Preference Learning: Learning from Human Feedback without RL Oct 20, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis Oct 20, 2023 Code Generation Language Modelling
Code Code Available 1Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning Oct 19, 2023 MuJoCo Prompt Engineering
Code Code Available 1Towards Robust Offline Reinforcement Learning under Diverse Data Corruption Oct 19, 2023 Offline RL Q-Learning
Code Code Available 1Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization Oct 18, 2023 Diversity Image Generation
Code Code Available 1AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents Oct 15, 2023 In-Context Learning In-Context Reinforcement Learning
Code Code Available 1Reduced Policy Optimization for Continuous Control with Hard Constraints Oct 14, 2023 continuous-control Continuous Control
Code Code Available 1METRA: Scalable Unsupervised RL with Metric-Aware Abstraction Oct 13, 2023 Reinforcement Learning (RL) Unsupervised Pre-training
Code Code Available 1Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias Oct 12, 2023 D4RL Offline RL
Code Code Available 1Aligning Language Models with Human Preferences via a Bayesian Approach Oct 9, 2023 Contrastive Learning Reinforcement Learning (RL)
Code Code Available 1Safe Deep Policy Adaptation Oct 8, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models Oct 8, 2023 GPU Reinforcement Learning (RL)
Code Code Available 1Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets Oct 6, 2023 D4RL Decision Making
Code Code Available 1Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning Oct 6, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design Oct 4, 2023 Deep Reinforcement Learning General Reinforcement Learning
Code Code Available 1PGDQN: Preference-Guided Deep Q-Network Oct 3, 2023 Atari Games Benchmarking
Code Code Available 1Improving Planning with Large Language Models: A Modular Agentic Architecture Sep 30, 2023 In-Context Learning Reinforcement Learning (RL)
Code Code Available 1Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning Sep 29, 2023 Image Generation Offline RL
Code Code Available 1AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback Sep 29, 2023 Common Sense Reasoning Decision Making
Code Code Available 1Motif: Intrinsic Motivation from Artificial Intelligence Feedback Sep 29, 2023 Decision Making Language Modeling
Code Code Available 1Zero-Shot Reinforcement Learning from Low Quality Data Sep 26, 2023 Offline RL reinforcement-learning
Code Code Available 1Recurrent Hypernetworks are Surprisingly Strong in Meta-RL Sep 26, 2023 Deep Reinforcement Learning Few-Shot Learning
Code Code Available 1Enhancing data efficiency in reinforcement learning: a novel imagination mechanism based on mesh information propagation Sep 25, 2023 Reinforcement Learning (RL)
Code Code Available 1KuaiSim: A Comprehensive Simulator for Recommender Systems Sep 22, 2023 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 1Physics-constrained robust learning of open-form partial differential equations from limited and noisy data Sep 14, 2023 Form Reinforcement Learning (RL)
Code Code Available 1VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning Sep 14, 2023 Offline RL reinforcement-learning
Code Code Available 1Toward Discretization-Consistent Closure Schemes for Large Eddy Simulation Using Reinforcement Learning Sep 12, 2023 Reinforcement Learning (RL)
Code Code Available 1Reasoning with Latent Diffusion in Offline Reinforcement Learning Sep 12, 2023 D4RL Offline RL
Code Code Available 1Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt Learning Aug 28, 2023 Prompt Learning Reinforcement Learning (RL)
Code Code Available 1Language Reward Modulation for Pretraining Reinforcement Learning Aug 23, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Generation for Few-shot Learning Aug 14, 2023 Few-Shot Learning Reinforcement Learning (RL)
Code Code Available 1Reinforcement Learning for Financial Index Tracking Aug 5, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation Aug 4, 2023 Abstractive Text Summarization Language Modeling
Code Code Available 1qgym: A Gym for Training and Benchmarking RL-Based Quantum Compilation Aug 1, 2023 Benchmarking OpenAI Gym
Code Code Available 1