Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory May 25, 2023 Common Sense Reasoning CPU
Code Code Available 2REBEL: Reinforcement Learning via Regressing Relative Rewards Apr 25, 2024 continuous-control Continuous Control
Code Code Available 2FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation May 22, 2023 Imitation Learning Motion Planning
Code Code Available 2G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning May 19, 2025 Language Modeling Language Modelling
Code Code Available 2FlowReasoner: Reinforcing Query-Level Meta-Agents Apr 21, 2025 Reinforcement Learning (RL)
Code Code Available 2Foundation Policies with Hilbert Representations Feb 23, 2024 Reinforcement Learning (RL) Unsupervised Pre-training
Code Code Available 2Generalized Inner Loop Meta-Learning Oct 3, 2019 Meta-Learning reinforcement-learning
Code Code Available 2Godot Reinforcement Learning Agents Dec 7, 2021 CPU reinforcement-learning
Code Code Available 2High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning Jul 8, 2025 MME Reinforcement Learning (RL)
Code Code Available 2Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning Dec 16, 2020 Model-based Reinforcement Learning Prediction
Code Code Available 2FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance Dec 13, 2021 Deep Reinforcement Learning GPU
Code Code Available 2Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design Oct 17, 2024 Protein Design Reinforcement Learning (RL)
Code Code Available 2FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation Apr 19, 2024 Decoder Network Embedding
Code Code Available 2Feedback Efficient Online Fine-Tuning of Diffusion Models Feb 26, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 2Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Mar 31, 2025 Logical Reasoning Multiple-choice
Code Code Available 2Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Feb 10, 2025 Math Mathematical Reasoning
Code Code Available 2Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods Mar 25, 2020 Distributed Computing Reinforcement Learning
Code Code Available 2Flightmare: A Flexible Quadrotor Simulator Sep 1, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 2EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking Apr 2, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 2Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning Mar 19, 2024 Inductive Bias Reinforcement Learning (RL)
Code Code Available 2Evolving Reservoirs for Meta Reinforcement Learning Dec 9, 2023 Meta Reinforcement Learning reinforcement-learning
Code Code Available 2Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization Sep 2, 2024 Diversity Offline RL
Code Code Available 2Emergent Tool Use From Multi-Agent Autocurricula Sep 17, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 2AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control Apr 5, 2021 Imitation Learning Reinforcement Learning (RL)
Code Code Available 2EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data Mar 1, 2024 continuous-control Continuous Control
Code Code Available 2Efficient Online Reinforcement Learning with Offline Data Feb 6, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 2ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning Dec 11, 2021 Deep Reinforcement Learning GPU
Code Code Available 2EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems Feb 23, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 2AndroidEnv: A Reinforcement Learning Platform for Android May 27, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 2Distributional Soft Actor-Critic with Three Refinements Oct 9, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 2Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Mar 14, 2024 Math Reinforcement Learning (RL)
Code Code Available 2Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning Apr 17, 2025 Multimodal Reasoning Reinforcement Learning (RL)
Code Code Available 2Flow: A Modular Learning Framework for Mixed Autonomy Traffic Oct 16, 2017 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 2Direct Multi-Turn Preference Optimization for Language Agents Jun 21, 2024 Reinforcement Learning (RL)
Code Code Available 2AGILE: A Novel Reinforcement Learning Framework of LLM Agents May 23, 2024 Question Answering reinforcement-learning
Code Code Available 2DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue May 26, 2025 Diagnostic Question Answering
Code Code Available 2Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning Aug 12, 2022 D4RL Offline RL
Code Code Available 2Diffusion Models for Reinforcement Learning: A Survey Nov 2, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 2Accelerated Methods for Deep Reinforcement Learning Mar 7, 2018 Atari Games Deep Reinforcement Learning
Code Code Available 2DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation May 12, 2025 Language Modeling Language Modelling
Code Code Available 2AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers Nov 17, 2024 In-Context Learning Meta-Learning
Code Code Available 2Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data Dec 10, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 2Efficient World Models with Context-Aware Tokenization Jun 27, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 2A Critical Evaluation of AI Feedback for Aligning Large Language Models Feb 19, 2024 Instruction Following reinforcement-learning
Code Code Available 2Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving May 12, 2025 Math Mathematical Problem-Solving
Code Code Available 2Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization May 25, 2024 continuous-control Continuous Control
Code Code Available 2Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization Oct 11, 2024 GSM8K Language Modeling
Code Code Available 2Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models Jun 15, 2025 Reinforcement Learning (RL)
Code Code Available 2Digi-Q: Learning Q-Value Functions for Training Device-Control Agents Feb 13, 2025 Q-Learning Reinforcement Learning (RL)
Code Code Available 2DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation Oct 19, 2022 Deep Reinforcement Learning Imitation Learning
Code Code Available 2