Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic Jun 5, 2023 continuous-control Continuous Control
Code Code Available 1Select and Trade: Towards Unified Pair Trading with Hierarchical Reinforcement Learning Jan 25, 2023 Hierarchical Reinforcement Learning PAIR TRADING
Code Code Available 1Self-critical Sequence Training for Image Captioning Dec 2, 2016 Image Captioning Policy Gradient Methods
Code Code Available 1Self-Driving Network and Service Coordination Using Deep Reinforcement Learning Nov 2, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Self-Paced Deep Reinforcement Learning Apr 24, 2020 Deep Reinforcement Learning Open-Ended Question Answering
Code Code Available 1Self-Play Reinforcement Learning for Fast Image Retargeting Oct 2, 2020 Image Retargeting reinforcement-learning
Code Code Available 1Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning Oct 6, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Self-supervised Visual Reinforcement Learning with Object-centric Representations Nov 29, 2020 Object reinforcement-learning
Code Code Available 1Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning Aug 31, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 1Sequential Voting with Relational Box Fields for Active Object Detection Oct 21, 2021 Active Object Detection Decision Making
Code Code Available 1Conditional Mutual Information for Disentangled Representations in Reinforcement Learning May 23, 2023 continuous-control Continuous Control
Code Code Available 1Settling the Variance of Multi-Agent Policy Gradients Aug 19, 2021 MuJoCo Reinforcement Learning (RL)
Code Code Available 1ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning Sep 4, 2020 Bayesian Optimization reinforcement-learning
Code Code Available 1SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments Jan 31, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1A Multiplicative Value Function for Safe and Efficient Reinforcement Learning Mar 7, 2023 Navigate reinforcement-learning
Code Code Available 1Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks Jul 13, 2021 continuous-control Continuous Control
Code Code Available 1Compositional Reinforcement Learning from Logical Specifications Jun 25, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1Simple random search provides a competitive approach to reinforcement learning Mar 19, 2018 Computational Efficiency continuous-control
Code Code Available 1Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience Aug 9, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Addressing Function Approximation Error in Actor-Critic Methods Feb 26, 2018 Continuous Control OpenAI Gym
Code Code Available 1CompoSuite: A Compositional Reinforcement Learning Benchmark Jul 8, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Compiler Optimization for Quantum Computing Using Reinforcement Learning Dec 8, 2022 Compiler Optimization reinforcement-learning
Code Code Available 1Simulating SQL Injection Vulnerability Exploitation Using Q-Learning Reinforcement Learning Agents Jan 8, 2021 Q-Learning reinforcement-learning
Code Code Available 1Batch Exploration with Examples for Scalable Robotic Reinforcement Learning Oct 22, 2020 Offline RL reinforcement-learning
Code Code Available 1Competitiveness of MAP-Elites against Proximal Policy Optimization on locomotion tasks in deterministic simulations Sep 17, 2020 Evolutionary Algorithms Reinforcement Learning (RL)
Code Code Available 1Compile Scene Graphs with Reinforcement Learning Apr 18, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions Jun 9, 2025 Reinforcement Learning (RL)
Code Code Available 1Skill-Based Reinforcement Learning with Intrinsic Reward Matching Oct 14, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Connecting Deep-Reinforcement-Learning-based Obstacle Avoidance with Conventional Global Planners using Waypoint Generators Apr 8, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning May 9, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching Feb 4, 2022 Imitation Learning Reinforcement Learning (RL)
Code Code Available 1SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments Jun 17, 2022 Benchmarking Deep Reinforcement Learning
Code Code Available 1Constrained Update Projection Approach to Safe Policy Optimization Sep 15, 2022 Reinforcement Learning (RL) Safe Reinforcement Learning
Code Code Available 1Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning Jul 29, 2022 Contrastive Learning Deep Reinforcement Learning
Code Code Available 1Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control Feb 16, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop Jul 20, 2020 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Soft Hindsight Experience Replay Feb 6, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning Nov 4, 2018 Decoder Multi-agent Reinforcement Learning
Code Code Available 1Solving Challenging Dexterous Manipulation Tasks With Trajectory Optimisation and Reinforcement Learning Sep 9, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Solving Compositional Reinforcement Learning Problems via Task Reduction Mar 13, 2021 continuous-control Continuous Control
Code Code Available 1Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings Nov 25, 2020 Deep Reinforcement Learning Model-based Reinforcement Learning
Code Code Available 1Solving the Traveling Salesperson Problem with Precedence Constraints by Deep Reinforcement Learning Jul 4, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Reinforcement Learning for Combining Search Methods in the Calibration of Economic ABMs Feb 23, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1CommonPower: A Framework for Safe Data-Driven Smart Grid Control Jun 5, 2024 Benchmarking energy management
Code Code Available 1Spatial-temporal recurrent reinforcement learning for autonomous ships Nov 2, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective May 11, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Bayesian Generational Population-Based Training Jul 19, 2022 Bayesian Optimization Reinforcement Learning (RL)
Code Code Available 1SplAgger: Split Aggregation for Meta-Reinforcement Learning Mar 5, 2024 continuous-control Continuous Control
Code Code Available 1Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problem Dec 8, 2020 Combinatorial Optimization Q-Learning
Code Code Available 1Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization Jun 2, 2020 Combinatorial Optimization Deep Reinforcement Learning
Code Code Available 1