SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1132611350 of 15113 papers

TitleStatusHype
UAV Aided Search and Rescue Operation Using Reinforcement Learning0
Value-driven Hindsight Modelling0
Optimistic Policy Optimization with Bandit Feedback0
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning0
Efficient Deep Reinforcement Learning via Adaptive Policy TransferCode0
Curriculum in Gradient-Based Meta-Reinforcement Learning0
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge0
Empirical Policy Evaluation with Supergraphs0
Adaptive Estimator Selection for Off-Policy EvaluationCode0
Multi-Issue Bargaining With Deep Reinforcement Learning0
MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding0
Reinforcement learning for the privacy preservation and manipulation of eye tracking data0
Reward Design for Driver Repositioning Using Multi-Agent Reinforcement Learning0
Langevin DQNCode0
Control Frequency Adaptation via Action Persistence in Batch Reinforcement LearningCode0
Adaptive Experience Selection for Policy Gradient0
Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning0
Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling0
The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI0
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement LearningCode0
Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning0
Robust Reinforcement Learning via Adversarial training with Langevin DynamicsCode0
Extended Markov Games to Learn Multiple Tasks in Multi-Agent Reinforcement LearningCode0
Deep Reinforcement Learning-Based Beam Tracking for Low-Latency Services in Vehicular Networks0
Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic0
Show:102550
← PrevPage 454 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified