SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 50515075 of 15113 papers

TitleStatusHype
How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 1: A Paradigmatic Theory0
Simple Noisy Environment Augmentation for Reinforcement LearningCode0
Rethinking Population-assisted Off-policy Reinforcement Learning0
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender SystemsCode0
Gym-preCICE: Reinforcement Learning Environments for Active Flow Control0
An Autonomous Non-monolithic Agent with Multi-mode Exploration based on Options FrameworkCode0
Validation of massively-parallel adaptive testing using dynamic control matching0
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality GuaranteesCode0
A Transfer Learning Approach to Minimize Reinforcement Learning Risks in Energy Optimization for Smart Buildings0
Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning0
A Federated Reinforcement Learning Framework for Link Activation in Multi-link Wi-Fi Networks0
One-Step Distributional Reinforcement Learning0
Reinforcement Learning with Partial Parametric Model Knowledge0
Multi-criteria Hardware Trojan Detection: A Reinforcement Learning Approach0
CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation ProcessingCode0
Distance Weighted Supervised Learning for Offline Interaction DataCode0
Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution TrajectoriesCode0
A Closer Look at Reward Decomposition for High-Level Robotic Explanations0
Loss- and Reward-Weighting for Efficient Distributed Reinforcement Learning0
Proximal Curriculum for Reinforcement Learning AgentsCode0
Model Extraction Attacks Against Reinforcement Learning Based Controllers0
What can online reinforcement learning with function approximation benefit from general coverage conditions?0
On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes0
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey0
Policy Resilience to Environment Poisoning Attacks on Reinforcement Learning0
Show:102550
← PrevPage 203 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified