SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 47764800 of 15113 papers

TitleStatusHype
Applications of Reinforcement Learning in Deregulated Power Market: A Comprehensive Review0
Applications of Reinforcement Learning in Finance -- Trading with a Double Deep Q-Network0
Appraisal-Guided Proximal Policy Optimization: Modeling Psychological Disorders in Dynamic Grid World0
Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods0
Approximate discounting-free policy evaluation from transient and recurrent states0
Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning0
Approximate Dynamic Programming For Linear Systems with State and Input Constraints0
Approximate Equivariance in Reinforcement Learning0
A policy gradient approach for optimization of smooth risk measures0
Approximate information state based convergence analysis of recurrent Q-learning0
Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning0
Approximately Solving Mean Field Games via Entropy-Regularized Deep Reinforcement Learning0
Approximated Multi-Agent Fitted Q Iteration0
Does DQN Learn?0
Approximate Robust NMPC using Reinforcement Learning0
Approximating a deep reinforcement learning docking agent using linear model trees0
Approximating Energy Market Clearing and Bidding With Model-Based Reinforcement Learning0
Approximating Euclidean by Imprecise Markov Decision Processes0
Approximating Martingale Process for Variance Reduction in Deep Reinforcement Learning with Large State Space0
Approximating meta-heuristics with homotopic recurrent neural networks0
Approximating Pareto Frontier through Bayesian-optimization-directed Robust Multi-objective Reinforcement Learning0
A Practical Adversarial Attack on Contingency Detection of Smart Energy Systems0
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning0
APRIL: Active Preference-learning based Reinforcement Learning0
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee0
Show:102550
← PrevPage 192 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified