SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1005110100 of 15113 papers

TitleStatusHype
GRIMGEP: Learning Progress for Robust Goal Sampling in Visual Deep Reinforcement Learning0
Deep Reinforcement Learning with Label Embedding Reward for Supervised Image Hashing0
Hierarchical Reinforcement Learning in StarCraft II with Human Expertise in Subgoals Selection0
TriFinger: An Open-Source Robot for Learning DexterityCode1
Managing caching strategies for stream reasoning with reinforcement learning0
SafePILCO: a software tool for safe and data-efficient policy synthesisCode1
Physics-Based Dexterous Manipulations with Estimated Hand Poses and Residual Reinforcement Learning0
Towards Sample Efficient Agents through Algorithmic AlignmentCode0
Distributed Deep Reinforcement Learning for Functional Split Control in Energy Harvesting Virtualized Small Cells0
Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning0
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning0
Adaptive Coordination Offsets for Signalized Arterial Intersections using Deep Reinforcement Learning0
Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion0
The Emergence of Adversarial Communication in Multi-Agent Reinforcement LearningCode1
A Gentle Lecture Note on Filtrations in Reinforcement Learning0
Contrastive Variational Reinforcement Learning for Complex ObservationsCode1
Deep Q-Network Based Multi-agent Reinforcement Learning with Binary Action Agents0
Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille KeyboardCode0
Fashion Captioning: Towards Generating Accurate Descriptions with Semantic RewardsCode1
Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without SacrificesCode1
Deep reinforcement learning to detect brain lesions on MRI: a proof-of-concept application of reinforcement learning to medical images0
Mixed-Initiative Level Design with RL BrushCode0
Deep Reinforcement Learning for Field Development Optimization0
Learning Power Control from a Fixed Batch of Data0
Reinforcement Learning-driven Information Seeking: A Quantum Probabilistic Approach0
Optimizing AD Pruning of Sponsored Search with Reinforcement Learning0
Robust Deep Reinforcement Learning through Adversarial LossCode1
Fast Adaptive Task Offloading in Edge Computing based on Meta Reinforcement LearningCode1
Area-wide traffic signal control based on a deep graph Q-Network (DGQN) trained in an asynchronous manner0
Aligning AI With Shared Human ValuesCode2
A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles0
Faded-Experience Trust Region Policy Optimization for Model-Free Power Allocation in Interference Channel0
Explanation of Reinforcement Learning Model in Dynamic Multi-Agent System0
A Relearning Approach to Reinforcement Learning for Control of Smart Buildings0
EasyRL: A Simple and Extensible Reinforcement Learning Framework0
Learning Transition Models with Time-delayed Causal Relations0
Robust Reinforcement Learning using Adversarial PopulationsCode1
Reinforced Epidemic Control: Saving Both Lives and EconomyCode1
Fully Decentralized Reinforcement Learning-based Control of Photovoltaics in Distribution Grids for Joint Provision of Real and Reactive Power0
Learning to Play Two-Player Perfect-Information Games without Knowledge0
Learning Agile Locomotion via Adversarial Training0
Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning0
Cooperative Control of Mobile Robots with Stackelberg Learning0
Proximal Deterministic Policy Gradient0
Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version0
Curriculum Learning with a Progression Function0
Spatial Geometric Reasoning for Room Layout Estimation via Deep Reinforcement Learning0
Neural Batch Sampling with Reinforcement Learning for Semi-Supervised Anomaly Detection0
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs0
Deep Reinforcement Learning Based Mobile Edge Computing for Intelligent Internet of Things0
Show:102550
← PrevPage 202 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified