SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1135111400 of 15113 papers

TitleStatusHype
Exploiting the potential of deep reinforcement learning for classification tasks in high-dimensional and unstructured data0
Teaching robots to perceive time -- A reinforcement learning approach (Extended version)0
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning0
Optimizing Collision Avoidance in Dense Airspace using Deep Reinforcement Learning0
Deep Reinforcement Learning for Smart Home Energy Management0
Extendable NFV-Integrated Control Method Using Reinforcement Learning0
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization ApproachCode0
Interestingness Elements for Explainable Reinforcement Learning: Understanding Agents' Capabilities and LimitationsCode0
Deep Reinforcement Learning for Motion Planning of Mobile Robots0
Deep Reinforcement Learning Designed Shinnar-Le Roux RF Pulse using Root-Flipping: DeepRF_SLR0
Benchmarking the Neural Linear Model for Regression0
Learning to grow: control of material self-assembly using evolutionary reinforcement learning0
Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation0
Distributional Reinforcement Learning for Energy-Based Sequential ModelsCode0
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning0
Unpaired Image Enhancement Featuring Reinforcement-Learning-Controlled Image Editing Software0
MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement LearningCode0
KARL: Knowledge-Aware Reasoning Memory Modeling with Reinforcement Learning of Vector Space0
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning0
Planning with Abstract Learned Models While Learning Transferable Subtasks0
PixelRL: Fully Convolutional Network with Reinforcement Learning for Image ProcessingCode0
UNAS: Differentiable Architecture Search Meets Reinforcement LearningCode0
Pseudo Random Number Generation: a Reinforcement Learning approachCode1
Fairness in Multi-agent Reinforcement Learning for Stock Trading0
Bayesian Linear Regression on Deep Representations0
Spatial Influence-aware Reinforcement Learning for Intelligent Transportation System0
Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator0
Resolving Congestions in the Air Traffic Management Domain via Multiagent Reinforcement Learning Methods0
More Efficient Off-Policy Evaluation through Regularized Targeted Learning0
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning0
Provably Efficient Reinforcement Learning with Aggregated States0
Dota 2 with Large Scale Deep Reinforcement LearningCode0
Lessons from reinforcement learning for biological representations of space0
Improved Activity Forecasting for Generating Trajectories0
Learning to Reach Goals via Iterated Supervised LearningCode0
Control-Tutored Reinforcement Learning0
The PlayStation Reinforcement Learning Environment (PSXLE)Code0
Provably Efficient Exploration in Policy Optimization0
Text as Environment: A Deep Reinforcement Learning Text Readability Assessment Model0
Quality of syntactic implication of RL-based sentence summarization0
SMiRL: Surprise Minimizing Reinforcement Learning in Unstable EnvironmentsCode0
Online Deep Reinforcement Learning for Autonomous UAV Navigation and Exploration of Outdoor Environments0
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning0
Biases for Emergent Communication in Multi-agent Reinforcement Learning0
Energy-aware Scheduling of Jobs in Heterogeneous Cluster Systems Using Deep Reinforcement Learning0
Efficient Robotic Task Generalization Using Deep Model Fusion Reinforcement Learning0
Imitation Learning via Off-Policy Distribution MatchingCode1
AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos0
Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion0
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation0
Show:102550
← PrevPage 228 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified