SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1105111075 of 15113 papers

TitleStatusHype
Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning0
Meta-Reinforcement Learning for Robotic Industrial Insertion Tasks0
The Immersion of Directed Multi-graphs in Embedding Fields. Generalisations0
Improving Sample Efficiency and Multi-Agent Communication in RL-based Train Rescheduling0
Age-Aware Status Update Control for Energy Harvesting IoT Sensors via Reinforcement Learning0
Can We Learn Heuristics For Graphical Model Inference Using Reinforcement Learning?0
Adaptive model selection in photonic reservoir computing by reinforcement learning0
Evolving Inborn Knowledge For Fast Adaptation in Dynamic POMDP ProblemsCode0
The Ingredients of Real-World Robotic Reinforcement Learning0
Reinforcement Learning Generalization with Surprise MinimizationCode0
A State Aggregation Approach for Solving Knapsack Problem with Deep Reinforcement Learning0
Automatic low-bit hybrid quantization of neural networks through meta learning0
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning0
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning0
Guiding Robot Exploration in Reinforcement Learning via Automated Planning0
Cooperative Perception with Deep Reinforcement Learning for Connected Vehicles0
Learning Dialog Policies from Weak Demonstrations0
Correct Me If You Can: Learning from Error Corrections and MarkingsCode0
Flexible and Efficient Long-Range Planning Through Curious Exploration0
AutoEG: Automated Experience Grafting for Off-Policy Deep Reinforcement Learning0
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning0
Sequential Anomaly Detection using Inverse Reinforcement Learning0
Reinforcement Learning to Optimize the Logistics Distribution Routes of Unmanned Aerial Vehicle0
SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning0
Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning0
Show:102550
← PrevPage 443 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified