SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1105111100 of 15113 papers

TitleStatusHype
Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning0
Meta-Reinforcement Learning for Robotic Industrial Insertion Tasks0
The Immersion of Directed Multi-graphs in Embedding Fields. Generalisations0
Improving Sample Efficiency and Multi-Agent Communication in RL-based Train Rescheduling0
Age-Aware Status Update Control for Energy Harvesting IoT Sensors via Reinforcement Learning0
Can We Learn Heuristics For Graphical Model Inference Using Reinforcement Learning?0
Adaptive model selection in photonic reservoir computing by reinforcement learning0
Evolving Inborn Knowledge For Fast Adaptation in Dynamic POMDP ProblemsCode0
The Ingredients of Real-World Robotic Reinforcement Learning0
Reinforcement Learning Generalization with Surprise MinimizationCode0
A State Aggregation Approach for Solving Knapsack Problem with Deep Reinforcement Learning0
Automatic low-bit hybrid quantization of neural networks through meta learning0
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning0
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning0
Guiding Robot Exploration in Reinforcement Learning via Automated Planning0
Cooperative Perception with Deep Reinforcement Learning for Connected Vehicles0
Learning Dialog Policies from Weak Demonstrations0
Correct Me If You Can: Learning from Error Corrections and MarkingsCode0
Flexible and Efficient Long-Range Planning Through Curious Exploration0
AutoEG: Automated Experience Grafting for Off-Policy Deep Reinforcement Learning0
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning0
Sequential Anomaly Detection using Inverse Reinforcement Learning0
Reinforcement Learning to Optimize the Logistics Distribution Routes of Unmanned Aerial Vehicle0
SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning0
Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning0
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition0
Learning as Reinforcement: Applying Principles of Neuroscience for More General Reinforcement Learning Agents0
Data-Driven Learning and Load Ensemble Control0
Attention Routing: track-assignment detailed routing using attention-based reinforcement learning0
Tightening Exploration in Upper Confidence Reinforcement Learning0
Self-Guided Evolution Strategies with Historical Estimated GradientsCode0
Superkernel Neural Architecture Search for Image Denoising0
Variational Policy Propagation for Multi-agent Reinforcement Learning0
Macro-Action-Based Deep Multi-Agent Reinforcement Learning0
Time Adaptive Reinforcement Learning0
Modeling Survival in model-based Reinforcement Learning0
Show Us the Way: Learning to Manage Dialog from Demonstrations0
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning0
Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation0
Deep Reinforcement Learning for Adaptive Learning Systems0
Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning0
Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion0
Data-Driven Robust Control Using Reinforcement Learning0
A Game Theoretic Framework for Model Based Reinforcement Learning0
Analyzing Reinforcement Learning Benchmarks with Random Weight GuessingCode0
OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence GenerationCode0
Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions0
Reinforcement Learning in a Physics-Inspired Semi-Markov EnvironmentCode0
Safe deep reinforcement learning-based constrained optimal control scheme for active distribution networks0
Improving Input-Output Linearizing Controllers for Bipedal Robots via Reinforcement Learning0
Show:102550
← PrevPage 222 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified