SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 34263450 of 15113 papers

TitleStatusHype
Reinforcement Learning for Topic ModelsCode0
Truncating Trajectories in Monte Carlo Reinforcement Learning0
Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization0
Explaining RL Decisions with TrajectoriesCode0
How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 2: Method and Applications0
Rethinking Population-assisted Off-policy Reinforcement Learning0
Federated Ensemble-Directed Offline Reinforcement LearningCode1
Toward Evaluating Robustness of Reinforcement Learning with Adversarial PolicyCode0
How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 1: A Paradigmatic Theory0
Simple Noisy Environment Augmentation for Reinforcement LearningCode0
Explainable Reinforcement Learning via a Causal World ModelCode1
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender SystemsCode0
Gym-preCICE: Reinforcement Learning Environments for Active Flow Control0
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in HealthcareCode1
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality GuaranteesCode0
Validation of massively-parallel adaptive testing using dynamic control matching0
An Autonomous Non-monolithic Agent with Multi-mode Exploration based on Options FrameworkCode0
Online Portfolio Management via Deep Reinforcement Learning with High-Frequency DataCode1
A Transfer Learning Approach to Minimize Reinforcement Learning Risks in Energy Optimization for Smart Buildings0
Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning0
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs TransformationCode1
A Federated Reinforcement Learning Framework for Link Activation in Multi-link Wi-Fi Networks0
One-Step Distributional Reinforcement Learning0
Multi-criteria Hardware Trojan Detection: A Reinforcement Learning Approach0
CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation ProcessingCode0
Show:102550
← PrevPage 138 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified