A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning Aug 5, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 2A Review of Safe Reinforcement Learning: Methods, Theory and Applications May 20, 2022 Autonomous Driving Decision Making
Code Code Available 2ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay May 22, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 2Foundation Policies with Hilbert Representations Feb 23, 2024 Reinforcement Learning (RL) Unsupervised Pre-training
Code Code Available 2Smooth Exploration for Robotic Reinforcement Learning May 12, 2020 continuous-control Continuous Control
Code Code Available 2Flightmare: A Flexible Quadrotor Simulator Sep 1, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 2FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation Apr 19, 2024 Decoder Network Embedding
Code Code Available 2Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and Perspectives Oct 21, 2024 Reinforcement Learning (RL)
Code Code Available 2Assessment of Reinforcement Learning for Macro Placement Feb 21, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 2A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges Nov 12, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 2Flow: A Modular Learning Framework for Mixed Autonomy Traffic Oct 16, 2017 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 2FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance Dec 13, 2021 Deep Reinforcement Learning GPU
Code Code Available 2JaxMARL: Multi-Agent RL Environments and Algorithms in JAX Nov 16, 2023 CPU GPU
Code Code Available 2JORLDY: a fully customizable open source framework for reinforcement learning Apr 11, 2022 MuJoCo OpenAI Gym
Code Code Available 2A Critical Evaluation of AI Feedback for Aligning Large Language Models Feb 19, 2024 Instruction Following reinforcement-learning
Code Code Available 2A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning Dec 12, 2010 Bayesian Optimization Hierarchical Reinforcement Learning
Code Code Available 2Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods Mar 25, 2020 Distributed Computing Reinforcement Learning
Code Code Available 2Language Models can Solve Computer Tasks Mar 30, 2023 Language Modelling Large Language Model
Code Code Available 2Feedback Efficient Online Fine-Tuning of Diffusion Models Feb 26, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 2Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design Oct 17, 2024 Protein Design Reinforcement Learning (RL)
Code Code Available 2Learning Heterogeneous Agent Cooperation via Multiagent League Training Nov 13, 2022 Diversity reinforcement-learning
Code Code Available 2Learning Physically Realizable Skills for Online Packing of General 3D Shapes Dec 5, 2022 3D geometry Action Generation
Code Code Available 2AndroidEnv: A Reinforcement Learning Platform for Android May 27, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 2Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Mar 31, 2025 Logical Reasoning Multiple-choice
Code Code Available 2Learn to Reason Efficiently with Adaptive Length-based Reward Shaping May 21, 2025 Reinforcement Learning (RL)
Code Code Available 2EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking Apr 2, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 2Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning Mar 19, 2024 Inductive Bias Reinforcement Learning (RL)
Code Code Available 2Evolving Reservoirs for Meta Reinforcement Learning Dec 9, 2023 Meta Reinforcement Learning reinforcement-learning
Code Code Available 2Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Feb 10, 2025 Math Mathematical Reasoning
Code Code Available 2Machine Learning in Asset Management—Part 2: Portfolio Construction—Weight Optimization. The Journal of Financial Data Science Mar 26, 2020 Articles Asset Management
Code Code Available 2Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization Oct 11, 2024 GSM8K Language Modeling
Code Code Available 2AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control Apr 5, 2021 Imitation Learning Reinforcement Learning (RL)
Code Code Available 2Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models Jun 15, 2025 Reinforcement Learning (RL)
Code Code Available 2Emergent Tool Use From Multi-Agent Autocurricula Sep 17, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 2Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models Mar 18, 2025 Anatomy Attribute
Code Code Available 2Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning Feb 14, 2025 Reinforcement Learning (RL) Skills Assessment
Code Code Available 2Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning Apr 17, 2025 Multimodal Reasoning Reinforcement Learning (RL)
Code Code Available 2A Comparative Study of Algorithms for Intelligent Traffic Signal Control Sep 2, 2021 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 2Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization Sep 2, 2024 Diversity Offline RL
Code Code Available 2Efficient World Models with Context-Aware Tokenization Jun 27, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 2Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning Jun 4, 2021 counterfactual Deep Reinforcement Learning
Code Code Available 2MO-Gym: A Library of Multi-Objective Reinforcement Learning Environments Nov 30, 2022 Multi-Objective Reinforcement Learning OpenAI Gym
Code Code Available 2EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data Mar 1, 2024 continuous-control Continuous Control
Code Code Available 2DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy May 16, 2025 Reinforcement Learning (RL)
Code Code Available 2Multi-Agent Reinforcement Learning is a Sequence Modeling Problem May 30, 2022 Decision Making MuJoCo
Code Code Available 2GenRL: Multimodal-foundation world models for generalization in embodied agents Jun 26, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 2Benchmarking Potential Based Rewards for Learning Humanoid Locomotion Jul 19, 2023 Benchmarking Reinforcement Learning (RL)
Code Code Available 2Benchmarking Deep Reinforcement Learning for Continuous Control Apr 22, 2016 Action Triplet Recognition Atari Games
Code Code Available 2Natural Language Reinforcement Learning Nov 21, 2024 Decision Making reinforcement-learning
Code Code Available 2AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers Nov 17, 2024 In-Context Learning Meta-Learning
Code Code Available 2