SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1122611250 of 15113 papers

TitleStatusHype
Privacy-Cost Management in Smart Meters Using Deep Reinforcement Learning0
Reinforcement Learning for Mitigating Intermittent Interference in Terahertz Communication Networks0
Zooming for Efficient Model-Free Reinforcement Learning in Metric Spaces0
Transfer Reinforcement Learning under Unobserved Contextual Information0
Q* Approximation Schemes for Batch Reinforcement Learning: A Theoretical Comparison0
Stable Policy Optimization via Off-Policy Divergence RegularizationCode0
Advancing Renewable Electricity Consumption With Reinforcement Learning0
Human AI interaction loop training: New approach for interactive reinforcement learning0
Deep Adversarial Reinforcement Learning for Object Disentangling0
Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate0
Reinforcement Learning Based Cooperative Coded Caching under Dynamic Popularities in Ultra-Dense Networks0
Reinforcement Learning for Combinatorial Optimization: A Survey0
Convergence of Q-value in case of Gaussian rewards0
Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning0
Lane-Merging Using Policy-based Reinforcement Learning and Post-Optimization0
Smart Train Operation Algorithms based on Expert Knowledge and Reinforcement Learning0
Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing0
Distributional Robustness and Regularization in Reinforcement Learning0
Efficient and Effective Similar Subtrajectory Search with Deep Reinforcement Learning0
A Geometric Perspective on Visual Imitation Learning0
Deep Reinforcement Learning-BasedRobust Protection in DER-Rich Distribution Grids0
Dynamic Experience Replay0
Efficient statistical validation with edge cases to evaluate Highly Automated Vehicles0
Neural-Network Heuristics for Adaptive Bayesian Quantum Estimation0
Privacy-Aware Time-Series Data Sharing with Deep Reinforcement Learning0
Show:102550
← PrevPage 450 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified