SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 92269250 of 15113 papers

TitleStatusHype
Demystify Painting with RL0
Deploying Reinforcement Learning in Water Transport0
Automatic Source Code Summarization via Reinforcement Learning0
Learning Mobile Robot Navigation in the Dense Crowd with Deep Reinforcement Learning0
A case for new neural network smoothness constraints0
Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL0
Learning Visual Robotic Control Efficiently with Contrastive Pre-training and Data Augmentation0
SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning0
Policy Gradient RL Algorithms as Directed Acyclic GraphsCode1
Sim-to-real reinforcement learning applied to end-to-end vehicle controlCode1
Specializing Inter-Agent Communication in Heterogeneous Multi-Agent Reinforcement Learning using Agent Class Information0
Learning for MPC with Stability & Safety Guarantees0
Active Hierarchical Imitation and Reinforcement Learning0
A Reinforcement Learning Formulation of the Lyapunov Optimization: Application to Edge Computing Systems with Queue Stability0
Evolutionary learning of interpretable decision treesCode0
Reinforcement Learning with Subspaces using Free Energy Paradigm0
Tutoring Reinforcement Learning via Feedback Control0
Semi-supervised reward learning for offline reinforcement learning0
Noise-Robust End-to-End Quantum Control using Deep Autoregressive Policy Networks0
Regularizing Action Policies for Smooth Control with Reinforcement Learning0
OPAC: Opportunistic Actor-Critic0
Performance-Weighed Policy Sampling for Meta-Reinforcement Learning0
Blending MPC & Value Function Approximation for Efficient Reinforcement Learning0
Flatland-RL : Multi-Agent Reinforcement Learning on Trains0
An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy SearchCode1
Show:102550
← PrevPage 370 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified