SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 84518475 of 15113 papers

TitleStatusHype
Optimizing a domestic battery and solar photovoltaic system with deep reinforcement learning0
Multi-agent deep reinforcement learning (MADRL) meets multi-user MIMO systems0
Projected State-action Balancing Weights for Offline Reinforcement Learning0
User Tampering in Reinforcement Learning Recommender Systems0
OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution MatchingCode0
Self-supervised Reinforcement Learning with Independently Controllable Subgoals0
Deep Reinforcement Learning for Equal Risk Pricing and Hedging under Dynamic Expectile Risk Measures0
Incentivizing an Unknown Crowd0
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions0
A Bayesian Learning Algorithm for Unknown Zero-sum Stochastic Games with an Arbitrary Opponent0
Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning0
Integrated and Adaptive Guidance and Control for Endoatmospheric Missiles via Reinforcement Learning0
A Deep Reinforcement Learning Approach for Online Parcel Assignment0
Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning0
Safety-Critical Learning of Robot Control with Temporal Logic Specifications0
Robust Predictable Control0
On the impact of MDP design for Reinforcement Learning agents in Resource Management0
CyGIL: A Cyber Gym for Training Autonomous Agents over Emulated Network Systems0
Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning0
Delving into Macro Placement with Reinforcement Learning0
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented GuesserCode0
Guiding Global Placement With Reinforcement Learning0
Deep SIMBAD: Active Landmark-based Self-localization Using Ranking -based Scene Descriptor0
Method for making multi-attribute decisions in wargames by combining intuitionistic fuzzy numbers with reinforcement learning0
Temporal Shift Reinforcement LearningCode0
Show:102550
← PrevPage 339 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified