SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 50515075 of 15113 papers

TitleStatusHype
Plug-and-Play Model-Agnostic Counterfactual Policy Synthesis for Deep Reinforcement Learning based Recommendation0
Robust Reinforcement Learning using Offline DataCode1
Vehicle Type Specific Waypoint Generation0
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-20
Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems0
Object Detection with Deep Reinforcement LearningCode1
On the Importance of Critical Period in Multi-stage Reinforcement Learning0
Automating DBSCAN via Deep Reinforcement LearningCode1
Exploring the trade off between human driving imitation and safety for traffic simulation0
Generalized Reinforcement Learning: Experience Particles, Action Operator, Reinforcement Field, Memory Association, and Decision Concepts0
Intrinsically Motivated Learning of Causal World Models0
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past ExperienceCode1
Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning0
From Scratch to Sketch: Deep Decoupled Hierarchical Reinforcement Learning for Robotic Sketching AgentCode1
Continual Reinforcement Learning with TELLA0
Learning-Based Client Selection for Federated Learning Services Over Wireless Networks with Constrained Monetary Budgets0
Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning0
A Game-Theoretic Perspective of Generalization in Reinforcement Learning0
Multi-agent reinforcement learning for intent-based service assurance in cellular networks0
Socially Intelligent Genetic Agents for the Emergence of Explicit NormsCode0
Compositional Reinforcement Learning for Discrete-Time Stochastic Control Systems0
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement LearningCode2
Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment0
Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts0
DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV0
Show:102550
← PrevPage 203 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified