SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1500115050 of 15113 papers

TitleStatusHype
Exploration in Interactive Personalized Music Recommendation: A Reinforcement Learning Approach0
Reinforcement Learning for Matrix Computations: PageRank as an Example0
Reinforcement Learning Framework for Opportunistic Routing in WSNs0
Distributed Reinforcement Learning via Gossip0
Sample Complexity of Multi-task Reinforcement Learning0
Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb0
The Sample-Complexity of General Reinforcement Learning0
Coevolutionary networks of reinforcement-learning agents0
Evaluating State Representations for Reinforcement Learning of Turn-Taking Policies in Tutorial Dialogue0
Generating Student Feedback from Time-Series Data Using Reinforcement Learning0
Reinforcement Learning of Two-Issue Negotiation Dialogue Policies0
Sequential Transfer in Multi-armed Bandit with Finite Set of Models0
Model-Based Policy Gradients with Parameter-Based Exploration by Least-Squares Conditional Density Estimation0
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization0
Probabilistic inverse reinforcement learning in unknown environments0
Multi-Task Policy Search0
Scaling Up Robust MDPs by Reinforcement Learning0
Reinforcement learning with restrictions on the action set0
The association problem in wireless networks: a Policy Gradient Reinforcement Learning approach0
Direct Uncertainty Estimation in Reinforcement Learning0
(More) Efficient Reinforcement Learning via Posterior Sampling0
Reinforcement Learning for the Soccer Dribbling Task0
Cover Tree Bayesian Reinforcement Learning0
Regret Bounds for Reinforcement Learning with Policy Advice0
Non Deterministic Logic Programs0
A General Framework for Interacting Bayes-Optimally with Self-Interested Agents using Arbitrary Parametric Model and Model Prior0
Model-based Bayesian Reinforcement Learning for Dialogue Management0
Design for a Darwinian Brain: Part 2. Cognitive Architecture0
ABC Reinforcement Learning0
Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems0
A Greedy Approximation of Bayesian Reinforcement Learning with Probably Optimistic Transition Model0
Toggling a Genetic Switch Using Reinforcement Learning0
Hybrid Q-Learning Applied to Ubiquitous recommender system0
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning0
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning0
Reinforcement learning for port-Hamiltonian systems0
Weighted Likelihood Policy Search with Model Selection0
Nonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions0
Transferring Expectations in Model-based Reinforcement Learning0
Value Pursuit Iteration0
On-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization0
Neurally Plausible Reinforcement Learning of Working Memory Tasks0
Sketch-Based Linear Value Function Approximation0
Algorithms for Learning Markov Field Policies0
Bayesian Hierarchical Reinforcement Learning0
Inverse Reinforcement Learning through Structured Classification0
Exploration in Model-based Reinforcement Learning by Empirically Estimating Learning Progress0
Cost-Sensitive Exploration in Bayesian Reinforcement Learning0
TACT: A Transfer Actor-Critic Learning Framework for Energy Saving in Cellular Radio Access Networks0
Autonomous Reinforcement of Behavioral Sequences in Neural Dynamics0
Show:102550
← PrevPage 301 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified