SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1120111250 of 15113 papers

TitleStatusHype
Learning to Fly via Deep Model-Based Reinforcement LearningCode0
Adjust Planning Strategies to Accommodate Reinforcement Learning AgentsCode0
Generating Socially Acceptable Perturbations for Efficient Evaluation of Autonomous Vehicles0
Viewport-Aware Deep Reinforcement Learning Approach for 360^o Video Caching0
Placement Optimization with Deep Reinforcement Learning0
Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement Learning-based Traffic Congestion Control Systems0
A Deep Multi-Agent Reinforcement Learning Approach to Autonomous Separation AssuranceCode0
Improving Performance in Reinforcement Learning by Breaking Generalization in Neural Networks0
Reinforcement Learning for Electricity Network Operation0
Provably Efficient Exploration for Reinforcement Learning Using Unsupervised LearningCode0
Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous0
A General Framework for Learning Mean-Field Games0
Taylor Expansion Policy Optimization0
Optimizing Medical Treatment for Sepsis in Intensive Care: from Reinforcement Learning to Pre-Trial Evaluation0
Option Discovery in the Absence of Rewards with Manifold AnalysisCode0
The Chef's Hat Simulation Environment for Reinforcement-Learning-Based AgentsCode0
Analyzing Visual Representations in Embodied Navigation Tasks0
Heterogeneous Relational Reasoning in Knowledge Graphs with Reinforcement Learning0
Exploring Unknown States with Action BalanceCode0
Automatic Curriculum Learning For Deep RL: A Short Survey0
Explore and Exploit with Heterotic Line Bundle ModelsCode0
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey0
Mobility Management for Cellular-Connected UAVs: A Learning-Based Approach0
SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks0
Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning0
Privacy-Cost Management in Smart Meters Using Deep Reinforcement Learning0
Reinforcement Learning for Mitigating Intermittent Interference in Terahertz Communication Networks0
Zooming for Efficient Model-Free Reinforcement Learning in Metric Spaces0
Transfer Reinforcement Learning under Unobserved Contextual Information0
Q* Approximation Schemes for Batch Reinforcement Learning: A Theoretical Comparison0
Stable Policy Optimization via Off-Policy Divergence RegularizationCode0
Advancing Renewable Electricity Consumption With Reinforcement Learning0
Human AI interaction loop training: New approach for interactive reinforcement learning0
Deep Adversarial Reinforcement Learning for Object Disentangling0
Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate0
Reinforcement Learning Based Cooperative Coded Caching under Dynamic Popularities in Ultra-Dense Networks0
Reinforcement Learning for Combinatorial Optimization: A Survey0
Convergence of Q-value in case of Gaussian rewards0
Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning0
Lane-Merging Using Policy-based Reinforcement Learning and Post-Optimization0
Smart Train Operation Algorithms based on Expert Knowledge and Reinforcement Learning0
Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing0
Distributional Robustness and Regularization in Reinforcement Learning0
Efficient and Effective Similar Subtrajectory Search with Deep Reinforcement Learning0
A Geometric Perspective on Visual Imitation Learning0
Deep Reinforcement Learning-BasedRobust Protection in DER-Rich Distribution Grids0
Dynamic Experience Replay0
Efficient statistical validation with edge cases to evaluate Highly Automated Vehicles0
Neural-Network Heuristics for Adaptive Bayesian Quantum Estimation0
Privacy-Aware Time-Series Data Sharing with Deep Reinforcement Learning0
Show:102550
← PrevPage 225 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified