SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 37513775 of 15113 papers

TitleStatusHype
Deep Decentralized Reinforcement Learning for Cooperative Control0
Discovering Options for Exploration by Minimizing Cover Time0
Deep Reinforcement Learning for Unmanned Aerial Vehicle-Assisted Vehicular Networks0
Auxiliary Reward Generation with Transition Distance Representation Learning0
A State Aggregation Approach for Solving Knapsack Problem with Deep Reinforcement Learning0
Accelerating Stochastic Composition Optimization0
Deep Reinforcement Learning for Visual Object Tracking in Videos0
Deep Reinforcement Learning for Weapons to Targets Assignment in a Hypersonic strike0
Corruption-Robust Offline Reinforcement Learning0
Corruption-robust exploration in episodic reinforcement learning0
A stabilizing reinforcement learning approach for sampled systems with partially unknown models0
Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations0
Automated Video Game Testing Using Synthetic and Human-Like Agents0
A Variant of the Wang-Foster-Kakade Lower Bound for the Discounted Setting0
Discovery of Optimal Quantum Error Correcting Codes via Reinforcement Learning0
Deep Reinforcement Learning from Policy-Dependent Human Feedback0
Deep Reinforcement Learning From Raw Pixels in Doom0
Discrete Predictive Representation for Long-horizon Planning0
Deep reinforcement learning guided graph neural networks for brain network analysis0
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes0
Deep Reinforcement Learning in a Monetary Model0
Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey0
Average Cost Optimal Control of Stochastic Systems Using Reinforcement Learning0
Deep Reinforcement Learning in Cryptocurrency Market Making0
Correlation Priors for Reinforcement Learning0
Show:102550
← PrevPage 151 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified