Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning May 6, 2024 Reinforcement Learning (RL)
Code Code Available 2REBEL: Reinforcement Learning via Regressing Relative Rewards Apr 25, 2024 continuous-control Continuous Control
Code Code Available 2FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation Apr 19, 2024 Decoder Network Embedding
Code Code Available 2Sustainability of Data Center Digital Twins with Reinforcement Learning Apr 16, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 2EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking Apr 2, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 2Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning Mar 19, 2024 Inductive Bias Reinforcement Learning (RL)
Code Code Available 2Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Mar 14, 2024 Math Reinforcement Learning (RL)
Code Code Available 2LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments Mar 13, 2024 Decision Making Language Modeling
Code Code Available 2EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data Mar 1, 2024 continuous-control Continuous Control
Code Code Available 2Curiosity-driven Red-teaming for Large Language Models Feb 29, 2024 Red Teaming Reinforcement Learning (RL)
Code Code Available 2Feedback Efficient Online Fine-Tuning of Diffusion Models Feb 26, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 2GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction Feb 25, 2024 3D Reconstruction Active 3D Reconstruction
Code Code Available 2EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems Feb 23, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 2Foundation Policies with Hilbert Representations Feb 23, 2024 Reinforcement Learning (RL) Unsupervised Pre-training
Code Code Available 2A Critical Evaluation of AI Feedback for Aligning Large Language Models Feb 19, 2024 Instruction Following reinforcement-learning
Code Code Available 2Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Feb 15, 2024 All Decision Making
Code Code Available 2Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning Feb 8, 2024 GSM8K reinforcement-learning
Code Code Available 2RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback Feb 6, 2024 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 2StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback Feb 2, 2024 Code Completion Code Generation
Code Code Available 2Towards Efficient Exact Optimization of Language Model Alignment Feb 1, 2024 Language Modeling Language Modelling
Code Code Available 2Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management Feb 1, 2024 Deep Reinforcement Learning Management
Code Code Available 2True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning Jan 25, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 2LLMLight: Large Language Models as Traffic Signal Control Agents Dec 26, 2023 Decision Making Management
Code Code Available 2OpenRL: A Unified Reinforcement Learning Framework Dec 20, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 2Evolving Reservoirs for Meta Reinforcement Learning Dec 9, 2023 Meta Reinforcement Learning reinforcement-learning
Code Code Available 2Learning to Fly in Seconds Nov 22, 2023 GPU Reinforcement Learning (RL)
Code Code Available 2JaxMARL: Multi-Agent RL Environments and Algorithms in JAX Nov 16, 2023 CPU GPU
Code Code Available 2Diffusion Models for Reinforcement Learning: A Survey Nov 2, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 2TD-MPC2: Scalable, Robust World Models for Continuous Control Oct 25, 2023 continuous-control Continuous Control
Code Code Available 2Distributional Soft Actor-Critic with Three Refinements Oct 9, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 2RLLTE: Long-Term Evolution Project of Reinforcement Learning Sep 28, 2023 Language Modeling Language Modelling
Code Code Available 2A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning Sep 26, 2023 Benchmarking Multi-Objective Reinforcement Learning
Code Code Available 2Text2Reward: Reward Shaping with Language Models for Reinforcement Learning Sep 20, 2023 MuJoCo reinforcement-learning
Code Code Available 2Natural and Robust Walking using Reinforcement Learning without Demonstrations in High-Dimensional Musculoskeletal Models Sep 6, 2023 Reinforcement Learning (RL)
Code Code Available 2Benchmarking Potential Based Rewards for Learning Humanoid Locomotion Jul 19, 2023 Benchmarking Reinforcement Learning (RL)
Code Code Available 2When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment Jul 7, 2023 Reinforcement Learning (RL)
Code Code Available 2InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback Jun 26, 2023 Benchmarking Code Generation
Code Code Available 2Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX Jun 16, 2023 Decision Making reinforcement-learning
Code Code Available 2Datasets and Benchmarks for Offline Safe Reinforcement Learning Jun 15, 2023 Autonomous Driving Benchmarking
Code Code Available 2QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control Jun 15, 2023 CPU Deep Reinforcement Learning
Code Code Available 2RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control Jun 6, 2023 continuous-control Continuous Control
Code Code Available 2Thought Cloning: Learning to Think while Acting by Imitating Human Thinking Jun 1, 2023 Imitation Learning Reinforcement Learning (RL)
Code Code Available 2Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory May 25, 2023 Common Sense Reasoning CPU
Code Code Available 2FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation May 22, 2023 Imitation Learning Motion Planning
Code Code Available 2DiffMimic: Efficient Motion Mimicking with Differentiable Physics Apr 6, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 2Language Models can Solve Computer Tasks Mar 30, 2023 Language Modelling Large Language Model
Code Code Available 2Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning Mar 29, 2023 GPU reinforcement-learning
Code Code Available 2POPGym: Benchmarking Partially Observable Reinforcement Learning Mar 3, 2023 Benchmarking GPU
Code Code Available 2Reward Design with Language Models Feb 27, 2023 Language Modelling Large Language Model
Code Code Available 2Assessment of Reinforcement Learning for Macro Placement Feb 21, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 2