EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems Feb 23, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 2Feedback Efficient Online Fine-Tuning of Diffusion Models Feb 26, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 2AGILE: A Novel Reinforcement Learning Framework of LLM Agents May 23, 2024 Question Answering reinforcement-learning
Code Code Available 2A Critical Evaluation of AI Feedback for Aligning Large Language Models Feb 19, 2024 Instruction Following reinforcement-learning
Code Code Available 2DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue May 26, 2025 Diagnostic Question Answering
Code Code Available 2Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Mar 14, 2024 Math Reinforcement Learning (RL)
Code Code Available 2Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning Aug 12, 2022 D4RL Offline RL
Code Code Available 2FlowReasoner: Reinforcing Query-Level Meta-Agents Apr 21, 2025 Reinforcement Learning (RL)
Code Code Available 2FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation May 22, 2023 Imitation Learning Motion Planning
Code Code Available 2G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning May 19, 2025 Language Modeling Language Modelling
Code Code Available 2Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving May 12, 2025 Math Mathematical Problem-Solving
Code Code Available 2Generative Auto-Bidding with Value-Guided Explorations Apr 20, 2025 Reinforcement Learning (RL)
Code Code Available 2Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory May 25, 2023 Common Sense Reasoning CPU
Code Code Available 2Godot Reinforcement Learning Agents Dec 7, 2021 CPU reinforcement-learning
Code Code Available 2Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization May 25, 2024 continuous-control Continuous Control
Code Code Available 2Agent models: Internalizing Chain-of-Action Generation into Reasoning models Mar 9, 2025 Action Generation Reinforcement Learning (RL)
Code Code Available 2Diffusion Models for Reinforcement Learning: A Survey Nov 2, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 2Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities Jun 22, 2025 Reinforcement Learning (RL)
Code Code Available 2Digi-Q: Learning Q-Value Functions for Training Device-Control Agents Feb 13, 2025 Q-Learning Reinforcement Learning (RL)
Code Code Available 2Heterogeneous Multi-Robot Reinforcement Learning Jan 17, 2023 Graph Neural Network Multi-agent Reinforcement Learning
Code Code Available 2DiffMimic: Efficient Motion Mimicking with Differentiable Physics Apr 6, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 2Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning Sep 18, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 2DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation Oct 19, 2022 Deep Reinforcement Learning Imitation Learning
Code Code Available 2A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning Aug 5, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 2ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay May 22, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 2HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context Jun 26, 2025 Large Language Model Multimodal Reasoning
Code Code Available 2Dialogue Learning With Human-In-The-Loop Nov 29, 2016 Question Answering reinforcement-learning
Code Code Available 2In-Hand Object Rotation via Rapid Motor Adaptation Oct 10, 2022 Object Reinforcement Learning (RL)
Code Code Available 2A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data Jul 23, 2024 Autonomous Driving Autonomous Racing
Code Code Available 2Interactive Differentiable Simulation May 26, 2019 Model Predictive Control parameter estimation
Code Code Available 2Diffusion Actor-Critic with Entropy Regulator May 24, 2024 Decision Making MuJoCo
Code Code Available 2Direct Multi-Turn Preference Optimization for Language Agents Jun 21, 2024 Reinforcement Learning (RL)
Code Code Available 2Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Feb 15, 2024 All Decision Making
Code Code Available 2JaxMARL: Multi-Agent RL Environments and Algorithms in JAX Nov 16, 2023 CPU GPU
Code Code Available 2Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data Dec 10, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 2Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks Oct 30, 2024 General Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 2A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning Sep 26, 2023 Benchmarking Multi-Objective Reinforcement Learning
Code Code Available 2Language Models can Solve Computer Tasks Mar 30, 2023 Language Modelling Large Language Model
Code Code Available 2Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot Feb 20, 2023 Efficient Exploration reinforcement-learning
Code Code Available 2Deep Reinforcement Learning for Multi-Agent Interaction Aug 2, 2022 BIG-bench Machine Learning Causal Inference
Code Code Available 2Learning Heterogeneous Agent Cooperation via Multiagent League Training Nov 13, 2022 Diversity reinforcement-learning
Code Code Available 2Learning Physically Realizable Skills for Online Packing of General 3D Shapes Dec 5, 2022 3D geometry Action Generation
Code Code Available 2Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling Jan 20, 2025 Imitation Learning Language Modeling
Code Code Available 2Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models May 30, 2018 Deep Reinforcement Learning Model-based Reinforcement Learning
Code Code Available 2Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching Dec 16, 2020 Combinatorial Optimization Decision Making
Code Code Available 2Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions Jun 9, 2025 Large Language Model Reinforcement Learning (RL)
Code Code Available 2DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems May 30, 2022 Diversity reinforcement-learning
Code Code Available 2Datasets and Benchmarks for Offline Safe Reinforcement Learning Jun 15, 2023 Autonomous Driving Benchmarking
Code Code Available 2D4RL: Datasets for Deep Data-Driven Reinforcement Learning Apr 15, 2020 D4RL Offline RL
Code Code Available 2CTR-Driven Advertising Image Generation with Multimodal Large Language Models Feb 5, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 2