A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning Sep 26, 2023 Benchmarking Multi-Objective Reinforcement Learning
Code Code Available 25 Habitat 2.0: Training Home Assistants to Rearrange their Habitat Jun 28, 2021 Deep Reinforcement Learning GPU
Code Code Available 25 High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning Jul 8, 2025 MME Reinforcement Learning (RL)
Code Code Available 25 Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks Aug 20, 2024 Multi-agent Reinforcement Learning Multi-Task Learning
Code Code Available 25 Flow: A Modular Learning Framework for Mixed Autonomy Traffic Oct 16, 2017 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 25 FlowReasoner: Reinforcing Query-Level Meta-Agents Apr 21, 2025 Reinforcement Learning (RL)
Code Code Available 25 FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance Dec 13, 2021 Deep Reinforcement Learning GPU
Code Code Available 25 Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design Oct 17, 2024 Protein Design Reinforcement Learning (RL)
Code Code Available 25 In-Hand Object Rotation via Rapid Motor Adaptation Oct 10, 2022 Object Reinforcement Learning (RL)
Code Code Available 25 Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and Perspectives Oct 21, 2024 Reinforcement Learning (RL)
Code Code Available 25 IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning Oct 19, 2024 Benchmarking Multi-agent Reinforcement Learning
Code Code Available 25 Assessment of Reinforcement Learning for Macro Placement Feb 21, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 25 Foundation Policies with Hilbert Representations Feb 23, 2024 Reinforcement Learning (RL) Unsupervised Pre-training
Code Code Available 25 JaxMARL: Multi-Agent RL Environments and Algorithms in JAX Nov 16, 2023 CPU GPU
Code Code Available 25 AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Jul 8, 2025 GPU reinforcement-learning
Code Code Available 25 Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Feb 10, 2025 Math Mathematical Reasoning
Code Code Available 25 ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay May 22, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 25 Language Models can Solve Computer Tasks Mar 30, 2023 Language Modelling Large Language Model
Code Code Available 25 Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Mar 31, 2025 Logical Reasoning Multiple-choice
Code Code Available 25 Feedback Efficient Online Fine-Tuning of Diffusion Models Feb 26, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 25 Learning Heterogeneous Agent Cooperation via Multiagent League Training Nov 13, 2022 Diversity reinforcement-learning
Code Code Available 25 Learning Physically Realizable Skills for Online Packing of General 3D Shapes Dec 5, 2022 3D geometry Action Generation
Code Code Available 25 A Review of Safe Reinforcement Learning: Methods, Theory and Applications May 20, 2022 Autonomous Driving Decision Making
Code Code Available 25 A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning Aug 16, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 25 Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions Jun 9, 2025 Large Language Model Reinforcement Learning (RL)
Code Code Available 25 AGILE: A Novel Reinforcement Learning Framework of LLM Agents May 23, 2024 Question Answering reinforcement-learning
Code Code Available 25 Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning Mar 19, 2024 Inductive Bias Reinforcement Learning (RL)
Code Code Available 25 LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments Mar 13, 2024 Decision Making Language Modeling
Code Code Available 25 EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking Apr 2, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 25 Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization Sep 2, 2024 Diversity Offline RL
Code Code Available 25 Evolving Reservoirs for Meta Reinforcement Learning Dec 9, 2023 Meta Reinforcement Learning reinforcement-learning
Code Code Available 25 Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods Mar 25, 2020 Distributed Computing Reinforcement Learning
Code Code Available 25 FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation May 22, 2023 Imitation Learning Motion Planning
Code Code Available 25 Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning Apr 17, 2025 Multimodal Reasoning Reinforcement Learning (RL)
Code Code Available 25 ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning Dec 11, 2021 Deep Reinforcement Learning GPU
Code Code Available 25 MBRL-Lib: A Modular Library for Model-based Reinforcement Learning Apr 20, 2021 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 25 Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning Feb 14, 2025 Reinforcement Learning (RL) Skills Assessment
Code Code Available 25 A Comparative Study of Algorithms for Intelligent Traffic Signal Control Sep 2, 2021 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 25 Emergent Tool Use From Multi-Agent Autocurricula Sep 17, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 25 Efficient World Models with Context-Aware Tokenization Jun 27, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 25 Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning Jun 4, 2021 counterfactual Deep Reinforcement Learning
Code Code Available 25 MO-Gym: A Library of Multi-Objective Reinforcement Learning Environments Nov 30, 2022 Multi-Objective Reinforcement Learning OpenAI Gym
Code Code Available 25 Efficient Online Reinforcement Learning with Offline Data Feb 6, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 25 EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data Mar 1, 2024 continuous-control Continuous Control
Code Code Available 25 Multi-Agent Reinforcement Learning is a Sequence Modeling Problem May 30, 2022 Decision Making MuJoCo
Code Code Available 25 GenRL: Multimodal-foundation world models for generalization in embodied agents Jun 26, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 25 EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems Feb 23, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 25 Aligning AI With Shared Human Values Aug 5, 2020 Ethics reinforcement-learning
Code Code Available 25 Benchmarking Deep Reinforcement Learning for Continuous Control Apr 22, 2016 Action Triplet Recognition Atari Games
Code Code Available 25 Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Mar 14, 2024 Math Reinforcement Learning (RL)
Code Code Available 25