SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1075110800 of 15113 papers

TitleStatusHype
Cooperative Perception with Deep Reinforcement Learning for Connected Vehicles0
Model-Based Meta-Reinforcement Learning for Flight with Suspended PayloadsCode1
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning0
Flexible and Efficient Long-Range Planning Through Curious Exploration0
Chip Placement with Deep Reinforcement LearningCode1
AutoEG: Automated Experience Grafting for Off-Policy Deep Reinforcement Learning0
Sequential Anomaly Detection using Inverse Reinforcement Learning0
Tactical Decision-Making in Autonomous Driving by Reinforcement Learning with Uncertainty EstimationCode1
SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning0
Reinforcement Learning to Optimize the Logistics Distribution Routes of Unmanned Aerial Vehicle0
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition0
Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning0
Self-Guided Evolution Strategies with Historical Estimated GradientsCode0
Tightening Exploration in Upper Confidence Reinforcement Learning0
Learning as Reinforcement: Applying Principles of Neuroscience for More General Reinforcement Learning Agents0
Data-Driven Learning and Load Ensemble Control0
Energy-Based Imitation LearningCode1
Attention Routing: track-assignment detailed routing using attention-based reinforcement learning0
Variational Policy Propagation for Multi-agent Reinforcement Learning0
Superkernel Neural Architecture Search for Image Denoising0
Modeling Survival in model-based Reinforcement Learning0
Macro-Action-Based Deep Multi-Agent Reinforcement Learning0
Time Adaptive Reinforcement Learning0
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning0
Deep Reinforcement Learning for Adaptive Learning Systems0
Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning0
Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation0
Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion0
Show Us the Way: Learning to Manage Dialog from Demonstrations0
MARLeME: A Multi-Agent Reinforcement Learning Model Extraction LibraryCode1
Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions0
OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence GenerationCode0
Data-Driven Robust Control Using Reinforcement Learning0
A Game Theoretic Framework for Model Based Reinforcement Learning0
Fast Template Matching and Update for Video Object Tracking and SegmentationCode1
Analyzing Reinforcement Learning Benchmarks with Random Weight GuessingCode0
Continual Reinforcement Learning with Multi-Timescale ReplayCode1
Safe deep reinforcement learning-based constrained optimal control scheme for active distribution networks0
Extending Deep Reinforcement Learning Frameworks in Cryptocurrency Market Making0
D4RL: Datasets for Deep Data-Driven Reinforcement LearningCode2
Improving Input-Output Linearizing Controllers for Bipedal Robots via Reinforcement Learning0
ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos0
Zero-Shot Compositional Policy Learning via Language GroundingCode1
Prolog Technology Reinforcement Learning ProverCode1
Reinforcement Learning in a Physics-Inspired Semi-Markov EnvironmentCode0
Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems0
Reinforcement Learning Approach to Vibration Compensation for Dynamic Feed Drive Systems0
A Demonstration of Issues with Value-Based Multiobjective Reinforcement Learning Under Stochastic State Transitions0
Extrapolation in Gridworld Markov-Decision Processes0
A reinforcement learning application of guided Monte Carlo Tree Search algorithm for beam orientation selection in radiation therapy0
Show:102550
← PrevPage 216 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified