SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1077610800 of 15113 papers

TitleStatusHype
Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation0
Towards Tractable Optimism in Model-Based Reinforcement Learning0
Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees0
Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies0
Entropic Risk Constrained Soft-Robust Policy Optimization0
Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms0
Learn to Earn: Enabling Coordination within a Ride Hailing Fleet0
A Reinforcement Learning Approach for Transient Control of Liquid Rocket Engines0
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration0
FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimize0
On Reward-Free Reinforcement Learning with Linear Function Approximation0
Provably adaptive reinforcement learning in metric spaces0
WD3: Taming the Estimation Bias in Deep Reinforcement Learning0
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs0
Efficient Ridesharing Dispatch Using Multi-Agent Reinforcement LearningCode0
Cooperative Multi-Agent Reinforcement Learning with Partial Observations0
Distributed Value Function Approximation for Collaborative Multi-Agent Reinforcement Learning0
Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning0
Deep Reinforcement Learning amidst Lifelong Non-Stationarity0
Deep Reinforcement Learning Controller for 3D Path-following and Collision Avoidance by Autonomous Underwater Vehicles0
Delta Schema Network in Model-based Reinforcement LearningCode0
Eco-Vehicular Edge Networks for Connected Transportation: A Distributed Multi-Agent Reinforcement Learning Approach0
Introduction to Machine Learning for Accelerator Physics0
Green Simulation Assisted Reinforcement Learning with Model Risk for Biomanufacturing Learning and ControlCode0
Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework0
Show:102550
← PrevPage 432 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified