Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation Jun 24, 2021 MuJoCo OpenAI Gym
Code Code Available 2DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning Jun 11, 2021 Card Games Deep Reinforcement Learning
Code Code Available 2Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning Jun 4, 2021 counterfactual Deep Reinforcement Learning
Code Code Available 2AndroidEnv: A Reinforcement Learning Platform for Android May 27, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 2MBRL-Lib: A Modular Library for Model-based Reinforcement Learning Apr 20, 2021 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 2AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control Apr 5, 2021 Imitation Learning Reinforcement Learning (RL)
Code Code Available 2Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control Mar 3, 2021 Benchmarking Multi-agent Reinforcement Learning
Code Code Available 2Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning Dec 16, 2020 Model-based Reinforcement Learning Prediction
Code Code Available 2Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching Dec 16, 2020 Combinatorial Optimization Decision Making
Code Code Available 2Connections between Relational Event Model and Inverse Reinforcement Learning for Characterizing Group Interaction Sequences Oct 19, 2020 BIG-bench Machine Learning Reinforcement Learning (RL)
Code Code Available 2SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving Oct 19, 2020 Autonomous Driving Multi-agent Reinforcement Learning
Code Code Available 2PettingZoo: Gym for Multi-Agent Reinforcement Learning Sep 30, 2020 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 2Decoupling Representation Learning from Reinforcement Learning Sep 14, 2020 Data Augmentation Deep Reinforcement Learning
Code Code Available 2DRLE: Decentralized Reinforcement Learning at the Edge for Traffic Light Control in the IoV Sep 3, 2020 Edge-computing Management
Code Code Available 2Flightmare: A Flexible Quadrotor Simulator Sep 1, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 2Aligning AI With Shared Human Values Aug 5, 2020 Ethics reinforcement-learning
Code Code Available 2Smooth Exploration for Robotic Reinforcement Learning May 12, 2020 continuous-control Continuous Control
Code Code Available 2The Variational Bandwidth Bottleneck: Stochastic Evaluation on an Information Budget Apr 24, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 2D4RL: Datasets for Deep Data-Driven Reinforcement Learning Apr 15, 2020 D4RL Offline RL
Code Code Available 2Machine Learning in Asset Management—Part 2: Portfolio Construction—Weight Optimization. The Journal of Financial Data Science Mar 26, 2020 Articles Asset Management
Code Code Available 2Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods Mar 25, 2020 Distributed Computing Reinforcement Learning
Code Code Available 2Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning Mar 19, 2020 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 2Neuroevolution of Self-Interpretable Agents Mar 18, 2020 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 2Leveraging Procedural Generation to Benchmark Reinforcement Learning Dec 3, 2019 Procgen Hard (100M) reinforcement-learning
Code Code Available 2Learning to Predict Without Looking Ahead: World Models Without Forward Prediction Oct 29, 2019 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 2Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning Oct 24, 2019 Meta-Learning Meta Reinforcement Learning
Code Code Available 2Generalized Inner Loop Meta-Learning Oct 3, 2019 Meta-Learning reinforcement-learning
Code Code Available 2Emergent Tool Use From Multi-Agent Autocurricula Sep 17, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 2rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch Sep 3, 2019 Deep Reinforcement Learning Q-Learning
Code Code Available 2Interactive Differentiable Simulation May 26, 2019 Model Predictive Control parameter estimation
Code Code Available 2Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous Vehicles Dec 14, 2018 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 2Visual Reinforcement Learning with Imagined Goals Jul 12, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 2Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models May 30, 2018 Deep Reinforcement Learning Model-based Reinforcement Learning
Code Code Available 2Accelerated Methods for Deep Reinforcement Learning Mar 7, 2018 Atari Games Deep Reinforcement Learning
Code Code Available 2SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning Nov 13, 2017 Decoder reinforcement-learning
Code Code Available 2Flow: A Modular Learning Framework for Mixed Autonomy Traffic Oct 16, 2017 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 2Learning through Dialogue Interactions by Asking Questions Dec 15, 2016 reinforcement-learning Reinforcement Learning
Code Code Available 2Dialogue Learning With Human-In-The-Loop Nov 29, 2016 Question Answering reinforcement-learning
Code Code Available 2Benchmarking Deep Reinforcement Learning for Continuous Control Apr 22, 2016 Action Triplet Recognition Atari Games
Code Code Available 2A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning Dec 12, 2010 Bayesian Optimization Hierarchical Reinforcement Learning
Code Code Available 2Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Jul 14, 2025 Math Mathematical Reasoning
Code Code Available 1Deep Reinforcement Learning with Gradient Eligibility Traces Jul 12, 2025 Deep Reinforcement Learning MuJoCo
Code Code Available 1A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning Jul 11, 2025 Math Mathematical Reasoning
Code Code Available 1IRanker: Towards Ranking Foundation Model Jun 25, 2025 GSM8K model
Code Code Available 1KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality Jun 24, 2025 Hallucination Hallucination Evaluation
Code Code Available 1Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning Jun 16, 2025 Multimodal Reasoning Reinforcement Learning (RL)
Code Code Available 1A Production Scheduling Framework for Reinforcement Learning Under Real-World Constraints Jun 16, 2025 Job Shop Scheduling Reinforcement Learning (RL)
Code Code Available 1Visual Pre-Training on Unlabeled Images using Reinforcement Learning Jun 13, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1RePO: Replay-Enhanced Policy Optimization Jun 11, 2025 Math Mathematical Reasoning
Code Code Available 1ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs Jun 11, 2025 Code Generation Diagnostic
Code Code Available 1