SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 39513975 of 15113 papers

TitleStatusHype
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary PredictionCode0
Causal Reasoning from Meta-reinforcement LearningCode0
Few-shot Quality-Diversity OptimizationCode0
Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement LearningCode0
FFNet: Video Fast-Forwarding via Reinforcement LearningCode0
FeUdal Networks for Hierarchical Reinforcement LearningCode0
Feudal Graph Reinforcement LearningCode0
Financial Trading as a Game: A Deep Reinforcement Learning ApproachCode0
Feature-Attending Recurrent Modules for Generalization in Reinforcement LearningCode0
A Novel Approach to Curiosity and Explainable Reinforcement Learning via Interpretable Sub-GoalsCode0
FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent SystemsCode0
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement LearningCode0
Fast Rates for Maximum Entropy ExplorationCode0
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection ApproachCode0
Faster Reinforcement Learning Using Active SimulatorsCode0
Self-Learning Exploration and Mapping for Mobile Robots via Deep Reinforcement LearningCode0
Action-depedent Control Variates for Policy Optimization via Stein's IdentityCode0
Federated Control with Hierarchical Multi-Agent Deep Reinforcement LearningCode0
Flexible Option LearningCode0
GAC: A Deep Reinforcement Learning Model Toward User Incentivization in Unknown Social NetworksCode0
FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning AgentsCode0
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue SystemsCode0
MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement LearningCode0
Semifactual Explanations for Reinforcement LearningCode0
Causal Campbell-Goodhart's law and Reinforcement LearningCode0
Show:102550
← PrevPage 159 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified