SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 47014725 of 15113 papers

TitleStatusHype
An Invitation to Deep Reinforcement Learning0
An L^2 Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation0
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models0
Annotating Motion Primitives for Simplifying Action Search in Reinforcement Learning0
An ocular biomechanics environment for reinforcement learning0
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning0
An Offline Deep Reinforcement Learning for Maintenance Decision-Making0
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
Anomalous State Sequence Modeling to Enhance Safety in Reinforcement Learning0
Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning0
A non-cooperative meta-modeling game for automated third-party calibrating, validating, and falsifying constitutive laws with parallelized adversarial attacks0
An online evolving framework for advancing reinforcement-learning based automated vehicle control0
An Online Model-Following Projection Mechanism Using Reinforcement Learning0
An Online Prediction Algorithm for Reinforcement Learning with Linear Function Approximation using Cross Entropy Method0
An open source Multi-Agent Deep Reinforcement Learning Routing Simulator for satellite networks0
An Optics Controlling Environment and Reinforcement Learning Benchmarks0
An Optimal Control View of Adversarial Machine Learning0
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning0
An Optimization Framework for Task Sequencing in Curriculum Learning0
An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems0
An Option-Dependent Analysis of Regret Minimization Algorithms in Finite-Horizon Semi-Markov Decision Processes0
An Oracle and Observations for the OpenAI Gym / ALE Freeway Environment0
Reinforcement Learning with Wasserstein Distance Regularisation, with Applications to Multipolicy Learning0
A novel agent with formal goal-reaching guarantees: an experimental study with a mobile robot0
A novel approach for multi-agent cooperative pursuit to capture grouped evaders0
Show:102550
← PrevPage 189 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified