SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 54015425 of 15113 papers

TitleStatusHype
FreeKD: Free-direction Knowledge Distillation for Graph Neural Networks0
Defending Observation Attacks in Deep Reinforcement Learning via Detection and DenoisingCode0
Deep Reinforcement Learning for Exact Combinatorial Optimization: Learning to Branch0
Transformers are Meta-Reinforcement LearnersCode1
RoSGAS: Adaptive Social Bot Detection with Reinforced Self-Supervised GNN Architecture SearchCode1
Universally Expressive Communication in Multi-Agent Reinforcement LearningCode0
Solving the capacitated vehicle routing problem with timing windows using rollouts and MAX-SAT0
Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization0
Open-Ended Learning Strategies for Learning Complex Locomotion Skills0
Visual Radial Basis Q-Network0
Robust Reinforcement Learning with Distributional Risk-averse formulation0
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning0
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward0
Reinforcement Learning-based Placement of Charging Stations in Urban Road NetworksCode1
Provable Benefit of Multitask Representation Learning in Reinforcement Learning0
IGN : Implicit Generative NetworksCode0
Computation Offloading and Resource Allocation in F-RANs: A Federated Deep Reinforcement Learning Approach0
Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks0
Intrinsically motivated option learning: a comparative study of recent methods0
Relative Policy-Transition Optimization for Fast Policy Transfer0
RL-GA: A Reinforcement Learning-Based Genetic Algorithm for Electromagnetic Detection Satellite Scheduling Problem0
Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning0
Case-Based Inverse Reinforcement Learning Using Temporal CoherenceCode0
Deep Reinforcement Learning for Optimal Investment and Saving Strategy Selection in Heterogeneous Profiles: Intelligent Agents working towards retirement0
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum GamesCode1
Show:102550
← PrevPage 217 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified