StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models Oct 10, 2024 Question Answering Reinforcement Learning (RL)
Code Code Available 1Retrieval-Augmented Decision Transformer: External Memory for In-context RL Oct 9, 2024 In-Context Learning Reinforcement Learning (RL)
Code Code Available 1Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning Oct 8, 2024 GSM8K Multi-agent Reinforcement Learning
Code Code Available 1GreenLight-Gym: Reinforcement learning benchmark environment for control of greenhouse production systems Oct 6, 2024 Numerical Integration Reinforcement Learning (RL)
Code Code Available 1Predictive Coding for Decision Transformer Oct 4, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization Oct 4, 2024 Deep Reinforcement Learning Quantization
Code Code Available 1ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI Oct 3, 2024 Few-Shot Imitation Learning Imitation Learning
Code Code Available 1Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining Oct 1, 2024 Atari Games model
Code Code Available 1CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models Sep 27, 2024 Reinforcement Learning (RL) World Knowledge
Code Code Available 1ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning Sep 27, 2024 AutoML Benchmarking
Code Code Available 1DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors Sep 26, 2024 continuous-control Continuous Control
Code Code Available 1Reinforcement Learning-based Model Predictive Control for Greenhouse Climate Control Sep 19, 2024 Model Predictive Control Prediction
Code Code Available 1Leveraging Symmetry to Accelerate Learning of Trajectory Tracking Controllers for Free-Flying Robotic Systems Sep 17, 2024 Reinforcement Learning (RL)
Code Code Available 1Enhancing RL Safety with Counterfactual LLM Reasoning Sep 16, 2024 counterfactual Language Modeling
Code Code Available 1AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models Sep 13, 2024 Reinforcement Learning (RL)
Code Code Available 1Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control Aug 30, 2024 Model-based Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1What makes math problems hard for reinforcement learning: a case study Aug 27, 2024 Math Reinforcement Learning (RL)
Code Code Available 1Control-Informed Reinforcement Learning for Chemical Processes Aug 24, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization Aug 16, 2024 Decision Making reinforcement-learning
Code Code Available 1Fine-tuning LLMs for Autonomous Spacecraft Control: A Case Study Using Kerbal Space Program Aug 16, 2024 Reinforcement Learning (RL)
Code Code Available 1Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection Aug 13, 2024 Deep Reinforcement Learning Object
Code Code Available 1Listwise Reward Estimation for Offline Preference-based Reinforcement Learning Aug 8, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Model-Based Transfer Learning for Contextual Reinforcement Learning Aug 8, 2024 Bayesian Optimization continuous-control
Code Code Available 1RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning Aug 6, 2024 Combinatorial Optimization Graph Neural Network
Code Code Available 1Visual Grounding for Object-Level Generalization in Reinforcement Learning Aug 4, 2024 Language Modelling Object
Code Code Available 1Collision Probability Distribution Estimation via Temporal Difference Learning Jul 29, 2024 AI Agent Autonomous Driving
Code Code Available 1Reinforcement Learning Pair Trading: A Dynamic Scaling approach Jul 23, 2024 Algorithmic Trading Decision Making
Code Code Available 1OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning Jul 19, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Learning Goal-Conditioned Representations for Language Reward Models Jul 18, 2024 GSM8K Math
Code Code Available 1Variable-Agnostic Causal Exploration for Reinforcement Learning Jul 17, 2024 Causal Discovery reinforcement-learning
Code Code Available 1Chip Placement with Diffusion Models Jul 17, 2024 Dataset Generation Denoising
Code Code Available 1Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning Jul 17, 2024 MuJoCo reinforcement-learning
Code Code Available 1Reinforcement Learning in High-frequency Market Making Jul 14, 2024 Q-Learning reinforcement-learning
Code Code Available 1A Benchmark Environment for Offline Reinforcement Learning in Racing Games Jul 12, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Transductive Active Learning with Application to Safe Bayesian Optimization Jul 12, 2024 Active Learning Bayesian Optimization
Code Code Available 1Can Learned Optimization Make Reinforcement Learning Less Difficult? Jul 9, 2024 Decision Making Meta-Learning
Code Code Available 1Stranger Danger! Identifying and Avoiding Unpredictable Pedestrians in RL-based Social Robot Navigation Jul 8, 2024 Reinforcement Learning (RL) Robot Navigation
Code Code Available 1Hindsight Preference Learning for Offline Preference-based Reinforcement Learning Jul 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1RobocupGym: A challenging continuous control benchmark in Robocup Jul 3, 2024 Board Games continuous-control
Code Code Available 1PUZZLES: A Benchmark for Neural Algorithmic Reasoning Jun 29, 2024 Decision Making Logical Reasoning
Code Code Available 1Memory-Enhanced Neural Solvers for Efficient Adaptation in Combinatorial Optimization Jun 24, 2024 Combinatorial Optimization Reinforcement Learning (RL)
Code Code Available 1Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization Jun 20, 2024 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold Jun 20, 2024 Math Reinforcement Learning (RL)
Code Code Available 1Discovering Minimal Reinforcement Learning Environments Jun 18, 2024 continuous-control Continuous Control
Code Code Available 1Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning Jun 10, 2024 Atari Games Reinforcement Learning (RL)
Code Code Available 1ICU-Sepsis: A Benchmark MDP Built from Real Medical Data Jun 9, 2024 Benchmarking Management
Code Code Available 1HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning Jun 6, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Strategically Conservative Q-Learning Jun 6, 2024 D4RL Offline RL
Code Code Available 1Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning Jun 5, 2024 Quantization Reinforcement Learning (RL)
Code Code Available 1CommonPower: A Framework for Safe Data-Driven Smart Grid Control Jun 5, 2024 Benchmarking energy management
Code Code Available 1