SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 77267750 of 15113 papers

TitleStatusHype
Model-free Nearly Optimal Control of Constrained-Input Nonlinear Systems Based on Synchronous Reinforcement Learning0
Model-free optimal controller for discrete-time Markovian jump linear systems: A Q-learning approach0
Model-free optimal control of discrete-time systems with additive and multiplicative noises0
Model-Free Optimal Control of Linear Multi-Agent Systems via Decomposition and Hierarchical Approximation0
Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs0
Model Free Reinforcement Learning Algorithm for Stationary Mean field Equilibrium for Multiple Types of Agents0
Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey0
Model-free Reinforcement Learning for Stochastic Stackelberg Security Games0
Model-free Reinforcement Learning for Branching Markov Decision Processes0
Model-Free Reinforcement Learning for Symbolic Automata-encoded Objectives0
Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care0
Model-Free Reinforcement Learning for Asset Allocation0
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation0
Model-Free Reinforcement Learning for Optimal Control of MarkovDecision Processes Under Signal Temporal Logic Specifications0
Model-free Reinforcement Learning for Robust Locomotion using Demonstrations from Trajectory Optimization0
Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity0
Model-free Representation Learning and Exploration in Low-rank MDPs0
Model-Free Risk-Sensitive Reinforcement Learning0
Model-Free RL Agents Demonstrate System 1-Like Intentionality0
Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees0
Model-Free Unsupervised Learning for Optimization Problems with Constraints0
Model Generation with Provable Coverability for Offline Reinforcement Learning0
Model Imitation for Model-Based Reinforcement Learning0
Modeling Adaptive Platoon and Reservation Based Autonomous Intersection Control: A Deep Reinforcement Learning Approach0
Modeling and Interpreting Real-world Human Risk Decision Making with Inverse Reinforcement Learning0
Show:102550
← PrevPage 310 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified