SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1130111325 of 15113 papers

TitleStatusHype
Reward Shaping for Human Learning via Inverse Reinforcement LearningCode0
Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity0
Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic0
Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning0
Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approachCode0
Optimizing Traffic Lights with Multi-agent Deep Reinforcement Learning and V2X communication0
Wireless 2.0: Towards an Intelligent Radio Environment Empowered by Reconfigurable Meta-Surfaces and Artificial Intelligence0
Near-optimal Regret Bounds for Stochastic Shortest Path0
Rapidly Personalizing Mobile Health Treatment Policies with Limited Data0
Deep Reinforcement Learning with Linear Quadratic Regulator Regions0
Automatic Data Augmentation via Deep Reinforcement Learning for Effective Kidney Tumor Segmentation0
Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion0
Adversarial Radar Inference. From Inverse Tracking to Inverse Reinforcement Learning of Cognitive Radar0
Vehicle Tracking in Wireless Sensor Networks via Deep Reinforcement Learning0
On the Search for Feedback in Reinforcement Learning0
Accelerating Reinforcement Learning with a Directional-Gaussian-Smoothing Evolution Strategy0
Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep Reinforcement Learning Approach0
Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning0
Adaptive Temporal Difference Learning with Linear Function Approximation0
Automatic Gesture Recognition in Robot-assisted Surgery with Reinforcement Learning and Tree Search0
Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning0
Multi-Agent Reinforcement Learning as a Computational Tool for Language Evolution Research: Historical Context and Future Challenges0
oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions0
Debiased Off-Policy Evaluation for Recommendation Systems0
Multi-Agent Meta-Reinforcement Learning for Self-Powered and Sustainable Edge Computing Systems0
Show:102550
← PrevPage 453 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified