SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 51765200 of 15113 papers

TitleStatusHype
ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning0
Remote Electrical Tilt Optimization via Safe Reinforcement Learning0
Remote Rowhammer Attack using Adversarial Observations on Federated Learning Clients0
Rendering-Aware Reinforcement Learning for Vector Graphics Generation0
Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning0
Renewal Monte Carlo: Renewal theory based reinforcement learning0
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning0
REPAINT: Knowledge Transfer in Deep Actor-Critic Reinforcement Learning0
REPAINT: Knowledge Transfer in Deep Reinforcement Learning0
Reparameterized Policy Learning for Multimodal Trajectory Optimization0
Repeated Inverse Reinforcement Learning0
Replay across Experiments: A Natural Extension of Off-Policy RL0
Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning0
Replicability in Reinforcement Learning0
Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization0
REPNP: Plug-and-Play with Deep Reinforcement Learning Prior for Robust Image Restoration0
RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection0
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning0
Representational efficiency outweighs action efficiency in human program induction0
Representation and Invariance in Reinforcement Learning0
Representation and Reinforcement Learning for Personalized Glycemic Control in Septic Patients0
Representation Balancing Offline Model-based Reinforcement Learning0
Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model0
Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning0
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning0
Show:102550
← PrevPage 208 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified