Run, skeleton, run: skeletal model in a physics-based simulation Nov 18, 2017 Navigate Policy Gradient Methods
Code Code Available 0Unsupervised Reinforcement Learning in Multiple Environments Dec 16, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration Jul 15, 2021 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0MMaDA: Multimodal Large Diffusion Language Models May 21, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 0Unsupervised Representation Learning in Deep Reinforcement Learning: A Review Aug 27, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning Sep 30, 2022 Data Augmentation Image Generation
Code Code Available 0Paying Attention to Function Words Sep 24, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal May 23, 2019 Decision Making Deep Reinforcement Learning
Code Code Available 0Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning Sep 7, 2019 Deep Reinforcement Learning MuJoCo
Code Code Available 0The Value of Planning for Infinite-Horizon Model Predictive Control Apr 7, 2021 Model Predictive Control Reinforcement Learning (RL)
Code Code Available 0StarCraft II: A New Challenge for Reinforcement Learning Aug 16, 2017 Deep Reinforcement Learning Real-Time Strategy Games
Code Code Available 0Regularization Matters in Policy Optimization Oct 21, 2019 continuous-control Continuous Control
Code Code Available 0Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario Sep 25, 2022 Imitation Learning Reinforcement Learning (RL)
Code Code Available 0Normalization Enhances Generalization in Visual Reinforcement Learning Jun 1, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning May 26, 2024 Multi-Objective Reinforcement Learning reinforcement-learning
Code Code Available 0Safe and Efficient Off-Policy Reinforcement Learning Jun 8, 2016 Atari Games reinforcement-learning
Code Code Available 0StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning Apr 3, 2018 Real-Time Strategy Games reinforcement-learning
Code Code Available 0Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms Jul 27, 2022 continuous-control Continuous Control
Code Code Available 0PathNet: Evolution Channels Gradient Descent in Super Neural Networks Jan 30, 2017 Continual Learning reinforcement-learning
Code Code Available 0Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments Mar 24, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models Jun 9, 2024 Reinforcement Learning (RL) text-based games
Code Code Available 0Safe Chance Constrained Reinforcement Learning for Batch Process Control Apr 23, 2021 Gaussian Processes Model Predictive Control
Code Code Available 0STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs May 21, 2025 Efficient Exploration Reinforcement Learning (RL)
Code Code Available 0Safe Continuous Control with Constrained Model-Based Policy Optimization Apr 14, 2021 continuous-control Continuous Control
Code Code Available 0Verifiable and Compositional Reinforcement Learning Systems Jun 7, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice May 22, 2023 regression Reinforcement Learning (RL)
Code Code Available 0No Press Diplomacy: Modeling Multi-Agent Gameplay Sep 4, 2019 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning Apr 3, 2025 Reinforcement Learning (RL)
Code Code Available 0Unsupervised Task Clustering for Multi-Task Reinforcement Learning Jan 1, 2021 Atari Games Clustering
Code Code Available 0Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards May 23, 2025 Reinforcement Learning (RL)
Code Code Available 0Non-zero-sum Game Control for Multi-vehicle Driving via Reinforcement Learning Feb 8, 2023 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Unsupervised Attention Mechanism across Neural Network Layers Feb 27, 2019 Few-Shot Learning Image Classification
Code Code Available 0Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning Dec 1, 2019 Model-based Reinforcement Learning Reinforcement Learning
Code Code Available 0Mixture-of-Variational-Experts for Continual Learning Oct 25, 2021 Continual Learning Domain-IL Continual Learning
Code Code Available 0Regret Minimization for Reinforcement Learning with Vectorial Feedback and Complex Objectives Dec 1, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Think-J: Learning to Think for Generative LLM-as-a-Judge May 20, 2025 Offline RL Reinforcement Learning (RL)
Code Code Available 0Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning May 30, 2022 Multiple Instance Learning Reinforcement Learning (RL)
Code Code Available 0Regret Minimization for Partially Observable Deep Reinforcement Learning Oct 31, 2017 counterfactual Deep Reinforcement Learning
Code Code Available 0Regret Minimization Experience Replay in Off-Policy Reinforcement Learning May 15, 2021 MuJoCo reinforcement-learning
Code Code Available 0Safe, Efficient, and Comfortable Velocity Control based on Reinforcement Learning for Autonomous Driving Jan 29, 2019 Autonomous Driving Deep Reinforcement Learning
Code Code Available 0Nonlinear Inverse Reinforcement Learning with Gaussian Processes Dec 1, 2011 Gaussian Processes reinforcement-learning
Code Code Available 0Reinforcement learning with non-ergodic reward increments: robustness via ergodicity transformations Oct 17, 2023 Autonomous Driving reinforcement-learning
Code Code Available 0DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning Dec 1, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models May 30, 2025 Math Multiple-choice
Code Code Available 0Kernel-Based Reinforcement Learning: A Finite-Time Analysis Apr 12, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Partially Observable Residual Reinforcement Learning for PV-Inverter-Based Voltage Control in Distribution Grids Jun 24, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0Park: An Open Platform for Learning-Augmented Computer Systems Dec 1, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning Oct 4, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0State of the Art Control of Atari Games Using Shallow Reinforcement Learning Dec 4, 2015 Atari Games reinforcement-learning
Code Code Available 0Safe Exploration Method for Reinforcement Learning under Existence of Disturbance Sep 30, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0