IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Feb 5, 2018 Atari Games reinforcement-learning
Code Code Available 1Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO May 25, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs Jul 5, 2022 Fairness reinforcement-learning
Code Code Available 1Implicit Distributional Reinforcement Learning Jul 13, 2020 Distributional Reinforcement Learning OpenAI Gym
Code Code Available 1Accelerating Reinforcement Learning with Learned Skill Priors Oct 22, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Improving and Benchmarking Offline Reinforcement Learning Algorithms Jun 1, 2023 Attribute Benchmarking
Code Code Available 1Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay Jun 5, 2025 Reinforcement Learning (RL)
Code Code Available 1Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture May 28, 2021 Meta Reinforcement Learning MuJoCo
Code Code Available 1Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision Feb 10, 2021 Board Games Model-based Reinforcement Learning
Code Code Available 1Reinforcement Learning for Combining Search Methods in the Calibration of Economic ABMs Feb 23, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? Sep 26, 2019 Feature Engineering Q-Learning
Code Code Available 1Goal-directed graph construction using reinforcement learning Jan 30, 2020 Decision Making graph construction
Code Code Available 1Actor-Critic Reinforcement Learning for Control with Stability Guarantee Apr 29, 2020 Motion Planning reinforcement-learning
Code Code Available 1In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought May 31, 2024 D4RL Decision Making
Code Code Available 1A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning Oct 15, 2020 Management Multi-agent Reinforcement Learning
Code Code Available 1In Defense of the Unitary Scalarization for Deep Multi-Task Learning Jan 11, 2022 Multi-Task Learning Reinforcement Learning (RL)
Code Code Available 1A Game-Theoretic Approach to Multi-Agent Trust Region Optimization Jun 12, 2021 Atari Games MuJoCo
Code Code Available 1A simple but strong baseline for online continual learning: Repeated Augmented Rehearsal Sep 28, 2022 Continual Learning Reinforcement Learning (RL)
Code Code Available 1CommonPower: A Framework for Safe Data-Driven Smart Grid Control Jun 5, 2024 Benchmarking energy management
Code Code Available 1Information Directed Reward Learning for Reinforcement Learning Feb 24, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1Comparing Popular Simulation Environments in the Scope of Robotics and Reinforcement Learning Mar 8, 2021 CPU reinforcement-learning
Code Code Available 1Concise Reasoning via Reinforcement Learning Apr 7, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning Approach Nov 14, 2021 Algorithmic Trading General Reinforcement Learning
Code Code Available 1Intention-Conditioned Flow Occupancy Models Jun 10, 2025 Reinforcement Learning (RL)
Code Code Available 1Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning Sep 29, 2023 Image Generation Offline RL
Code Code Available 1Interactive Machine Learning of Musical Gesture Nov 26, 2020 BIG-bench Machine Learning Reinforcement Learning (RL)
Code Code Available 1Interferobot: aligning an optical interferometer by a reinforcement learning agent Jun 3, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Learning to combine primitive skills: A step towards versatile robotic manipulation Aug 2, 2019 Data Augmentation Imitation Learning
Code Code Available 1Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach Dec 1, 2023 Deep Reinforcement Learning Edge-computing
Code Code Available 1A Text-based Deep Reinforcement Learning Framework for Interactive Recommendation Apr 14, 2020 Deep Reinforcement Learning Interactive Recommendation
Code Code Available 1A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems Feb 9, 2020 Combinatorial Optimization Decoder
Code Code Available 1Intrinsic Reward Driven Imitation Learning via Generative Model Jun 26, 2020 Atari Games Imitation Learning
Code Code Available 1A General Contextualized Rewriting Framework for Text Summarization Jul 13, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Inverse Constrained Reinforcement Learning Nov 19, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning Nov 1, 2020 Multi-Task Learning reinforcement-learning
Code Code Available 1Investigating practical linear temporal difference learning Feb 28, 2016 reinforcement-learning Reinforcement Learning
Code Code Available 1Combining Modular Skills in Multitask Learning Feb 28, 2022 Instruction Following reinforcement-learning
Code Code Available 1Combinatorial Optimization with Policy Adaptation using Latent Space Search Nov 13, 2023 Benchmarking Combinatorial Optimization
Code Code Available 1A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards Jun 27, 2020 Machine Translation reinforcement-learning
Code Code Available 1Combining Deep Reinforcement Learning and Search for Imperfect-Information Games Jul 27, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Aspect Sentiment Triplet Extraction Using Reinforcement Learning Aug 13, 2021 Aspect Sentiment Triplet Extraction reinforcement-learning
Code Code Available 1JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning Jul 21, 2023 Benchmarking Combinatorial Optimization
Code Code Available 1Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization Jun 2, 2020 Combinatorial Optimization Deep Reinforcement Learning
Code Code Available 1Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values Oct 4, 2021 Decision Making Deep Reinforcement Learning
Code Code Available 1Accelerating Robot Learning of Contact-Rich Manipulations: A Curriculum Learning Study Apr 27, 2022 Contact-rich Manipulation Reinforcement Learning (RL)
Code Code Available 1Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task Environments Dec 1, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1A Deep Reinforced Model for Abstractive Summarization May 11, 2017 Abstractive Text Summarization Decoder
Code Code Available 1Collision Probability Distribution Estimation via Temporal Difference Learning Jul 29, 2024 AI Agent Autonomous Driving
Code Code Available 1Knowledge-guided Open Attribute Value Extraction with Reinforcement Learning Oct 19, 2020 Articles Attribute
Code Code Available 1Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning Jul 11, 2019 Natural Language Understanding reinforcement-learning
Code Code Available 1