SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 676700 of 15113 papers

TitleStatusHype
Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply ChainsCode1
Asynchronous Methods for Deep Reinforcement LearningCode1
Adversarial Search and Tracking with Multiagent Reinforcement Learning in Sparsely Observable EnvironmentCode1
Deep Reinforcement Learning for Conservation DecisionsCode1
Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement LearningCode1
Deep Reinforcement Learning for List-wise RecommendationsCode1
Adversarial Soft Advantage Fitting: Imitation Learning without Policy OptimizationCode1
Attacking Video Recognition Models with Bullet-Screen CommentsCode1
Deep Reinforcement Learning for Joint Spectrum and Power Allocation in Cellular NetworksCode1
A Traffic Light Dynamic Control Algorithm with Deep Reinforcement Learning Based on GNN PredictionCode1
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority InfluenceCode1
Deep Reinforcement Learning for Resource Allocation in Business ProcessesCode1
Augmenting Policy Learning with Routines Discovered from a Single DemonstrationCode1
Attractive or Faithful? Popularity-Reinforced Learning for Inspired Headline GenerationCode1
Deep reinforcement learning-designed radiofrequency waveform in MRICode1
Active Reinforcement Learning for Robust Building ControlCode1
Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation TasksCode1
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum GamesCode1
Aerial View Localization with Reinforcement Learning: Towards Emulating Search-and-RescueCode1
Accelerating lifelong reinforcement learning via reshaping rewardsCode1
Adversarial Policies: Attacking Deep Reinforcement LearningCode1
Deep Reinforcement Learning in Parameterized Action SpaceCode1
Deep Reinforcement Learning Control of Quantum CartpolesCode1
A fast balance optimization approach for charging enhancement of lithium-ion battery packs through deep reinforcement learningCode1
Deep Reinforcement Learning based Group Recommender SystemCode1
Show:102550
← PrevPage 28 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified