SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1120111225 of 15113 papers

TitleStatusHype
Learning to Fly via Deep Model-Based Reinforcement LearningCode0
Adjust Planning Strategies to Accommodate Reinforcement Learning AgentsCode0
Generating Socially Acceptable Perturbations for Efficient Evaluation of Autonomous Vehicles0
Viewport-Aware Deep Reinforcement Learning Approach for 360^o Video Caching0
Placement Optimization with Deep Reinforcement Learning0
Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement Learning-based Traffic Congestion Control Systems0
A Deep Multi-Agent Reinforcement Learning Approach to Autonomous Separation AssuranceCode0
Improving Performance in Reinforcement Learning by Breaking Generalization in Neural Networks0
Reinforcement Learning for Electricity Network Operation0
Provably Efficient Exploration for Reinforcement Learning Using Unsupervised LearningCode0
Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous0
A General Framework for Learning Mean-Field Games0
Taylor Expansion Policy Optimization0
Optimizing Medical Treatment for Sepsis in Intensive Care: from Reinforcement Learning to Pre-Trial Evaluation0
Option Discovery in the Absence of Rewards with Manifold AnalysisCode0
The Chef's Hat Simulation Environment for Reinforcement-Learning-Based AgentsCode0
Analyzing Visual Representations in Embodied Navigation Tasks0
Heterogeneous Relational Reasoning in Knowledge Graphs with Reinforcement Learning0
Exploring Unknown States with Action BalanceCode0
Automatic Curriculum Learning For Deep RL: A Short Survey0
Explore and Exploit with Heterotic Line Bundle ModelsCode0
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey0
Mobility Management for Cellular-Connected UAVs: A Learning-Based Approach0
SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks0
Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning0
Show:102550
← PrevPage 449 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified