Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and Perspectives Oct 21, 2024 Reinforcement Learning (RL)
Code Code Available 2Interactive Differentiable Simulation May 26, 2019 Model Predictive Control parameter estimation
Code Code Available 2Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning Jan 25, 2025 Answer Generation Multi-agent Reinforcement Learning
Code Code Available 2In-Hand Object Rotation via Rapid Motor Adaptation Oct 10, 2022 Object Reinforcement Learning (RL)
Code Code Available 2InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback Jun 26, 2023 Benchmarking Code Generation
Code Code Available 2Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX Jun 16, 2023 Decision Making reinforcement-learning
Code Code Available 2Learning to Fly in Seconds Nov 22, 2023 GPU Reinforcement Learning (RL)
Code Code Available 2A Review of Safe Reinforcement Learning: Methods, Theory and Applications May 20, 2022 Autonomous Driving Decision Making
Code Code Available 2Human-AI Shared Control via Policy Dissection May 31, 2022 Autonomous Driving Reinforcement Learning (RL)
Code Code Available 2Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks Aug 20, 2024 Multi-agent Reinforcement Learning Multi-Task Learning
Code Code Available 2High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning Jul 8, 2025 MME Reinforcement Learning (RL)
Code Code Available 2Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning Sep 18, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 2Habitat 2.0: Training Home Assistants to Rearrange their Habitat Jun 28, 2021 Deep Reinforcement Learning GPU
Code Code Available 2Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts Oct 13, 2022 Atari Games Decision Making
Code Code Available 2GTA1: GUI Test-time Scaling Agent Jul 8, 2025 Reinforcement Learning (RL) Task Planning
Code Code Available 2Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning Feb 6, 2023 Decision Making reinforcement-learning
Code Code Available 2Guiding Generative Protein Language Models with Reinforcement Learning Dec 17, 2024 Diversity reinforcement-learning
Code Code Available 2Heterogeneous Multi-Robot Reinforcement Learning Jan 17, 2023 Graph Neural Network Multi-agent Reinforcement Learning
Code Code Available 2HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context Jun 26, 2025 Large Language Model Multimodal Reasoning
Code Code Available 2GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning Apr 3, 2025 Reinforcement Learning (RL)
Code Code Available 2Godot Reinforcement Learning Agents Dec 7, 2021 CPU reinforcement-learning
Code Code Available 2Gradient Boosting Reinforcement Learning Jul 11, 2024 GPU reinforcement-learning
Code Code Available 2Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory May 25, 2023 Common Sense Reasoning CPU
Code Code Available 2Smooth Exploration for Robotic Reinforcement Learning May 12, 2020 continuous-control Continuous Control
Code Code Available 2Generalized Inner Loop Meta-Learning Oct 3, 2019 Meta-Learning reinforcement-learning
Code Code Available 2Generative Auto-Bidding with Value-Guided Explorations Apr 20, 2025 Reinforcement Learning (RL)
Code Code Available 2FlowReasoner: Reinforcing Query-Level Meta-Agents Apr 21, 2025 Reinforcement Learning (RL)
Code Code Available 2ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay May 22, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 2Foundation Policies with Hilbert Representations Feb 23, 2024 Reinforcement Learning (RL) Unsupervised Pre-training
Code Code Available 2GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction Feb 25, 2024 3D Reconstruction Active 3D Reconstruction
Code Code Available 2Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities Jun 22, 2025 Reinforcement Learning (RL)
Code Code Available 2iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement Jul 8, 2024 Language Modeling Language Modelling
Code Code Available 2Learning to Predict Without Looking Ahead: World Models Without Forward Prediction Oct 29, 2019 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 2Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design Oct 17, 2024 Protein Design Reinforcement Learning (RL)
Code Code Available 2Feedback Efficient Online Fine-Tuning of Diffusion Models Feb 26, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 2Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods Mar 25, 2020 Distributed Computing Reinforcement Learning
Code Code Available 2Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Mar 31, 2025 Logical Reasoning Multiple-choice
Code Code Available 2Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Feb 10, 2025 Math Mathematical Reasoning
Code Code Available 2Evolving Reservoirs for Meta Reinforcement Learning Dec 9, 2023 Meta Reinforcement Learning reinforcement-learning
Code Code Available 2AndroidEnv: A Reinforcement Learning Platform for Android May 27, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 2FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance Dec 13, 2021 Deep Reinforcement Learning GPU
Code Code Available 2Flow: A Modular Learning Framework for Mixed Autonomy Traffic Oct 16, 2017 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 2Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization Sep 2, 2024 Diversity Offline RL
Code Code Available 2Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization Oct 11, 2024 GSM8K Language Modeling
Code Code Available 2FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation May 22, 2023 Imitation Learning Motion Planning
Code Code Available 2G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning May 19, 2025 Language Modeling Language Modelling
Code Code Available 2Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models Jun 15, 2025 Reinforcement Learning (RL)
Code Code Available 2Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning Mar 19, 2024 Inductive Bias Reinforcement Learning (RL)
Code Code Available 2ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning Dec 11, 2021 Deep Reinforcement Learning GPU
Code Code Available 2AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control Apr 5, 2021 Imitation Learning Reinforcement Learning (RL)
Code Code Available 2