SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1115111200 of 15113 papers

TitleStatusHype
Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality?0
Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning0
Counterfactual Multi-Agent Reinforcement Learning with Graph Convolution Communication0
Learning Sparse Rewarded Tasks from Sub-Optimal DemonstrationsCode0
Constrained-Space Optimization and Reinforcement Learning for Complex Tasks0
Controlling Rayleigh-Bénard convection via Reinforcement Learning0
Learning to Ask Medical Questions using Reinforcement LearningCode0
Leverage the Average: an Analysis of KL Regularization in RL0
Augmented Q Imitation Learning (AQIL)Code0
Exploration in Action SpaceCode0
Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual DataCode0
Mimicking Evolution with Reinforcement Learning0
Robotic Table Tennis with Model-Free Reinforcement Learning0
Optimal Bidding Strategy without Exploration in Real-time Bidding0
Optimising Lockdown Policies for Epidemic Control using Reinforcement LearningCode0
Suphx: Mastering Mahjong with Deep Reinforcement LearningCode0
Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles with Uncertainties0
Parallel Knowledge Transfer in Multi-Agent Reinforcement Learning0
When Autonomous Systems Meet Accuracy and Transferability through AI: A Survey0
Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with Reward ShapingCode0
Policy Teaching via Environment Poisoning: Training-time Adversarial Attacks against Reinforcement LearningCode0
Learning medical triage from clinicians using Deep Q-Learning0
AirRL: A Reinforcement Learning Approach to Urban Air Quality Inference0
Adaptive Reward-Poisoning Attacks against Reinforcement Learning0
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms0
Towards Better Opioid Antagonists Using Deep Reinforcement Learning0
ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing0
Learning to Play Soccer by Reinforcement and Applying Sim-to-Real to Compete in the Real World0
Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning0
Finite-Time Analysis of Stochastic Gradient Descent under Markov Randomness0
Distributional Reinforcement Learning with Ensembles0
Learning Compact Reward for Image Captioning0
Driver Modeling through Deep Reinforcement Learning and Behavioral Game Theory0
Q-Learning in Regularized Mean-field Games0
Multi-Agent Reinforcement Learning for Problems with Combined Individual and Team Reward0
Incorporating Relational Background Knowledge into Reinforcement Learning via Differentiable Inductive Logic Programming0
Importance of using appropriate baselines for evaluation of data-efficiency in deep reinforcement learning for Atari0
Learning to Walk: Spike Based Reinforcement Learning for Hexapod Robot Central Pattern Generation0
Reinforcement Learning in Economics and Finance0
Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning Approach0
Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics0
Distributed Reinforcement Learning for Cooperative Multi-Robot Object Manipulation0
Deep Reinforcement Learning with Robust and Smooth Policy0
Deep Sets for Generalization in RL0
Deep Reinforcement Learning with Weighted Q-Learning0
Deep Constrained Q-learning0
Safe Reinforcement Learning of Control-Affine Systems with Vertex NetworksCode0
Towards Cognitive Routing based on Deep Reinforcement Learning0
Reinforcement learning enabled cooperative spectrum sensing in cognitive radio networks0
Exchangeable Input Representations for Reinforcement Learning0
Show:102550
← PrevPage 224 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified