SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 69016925 of 15113 papers

TitleStatusHype
Turning Mathematics Problems into Games: Reinforcement Learning and Gröbner bases together solve Integer Feasibility Problems0
Tutorial and Survey on Probabilistic Graphical Model and Variational Inference in Deep Reinforcement Learning0
Tutorial on Course-of-Action (COA) Attack Search Methods in Computer Networks0
Tutoring Reinforcement Learning via Feedback Control0
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning0
Twisting Lids Off with Two Hands0
Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play0
Two Can Play That Game: An Adversarial Evaluation of a Cyber-alert Inspection System0
Two-dimensional Anti-jamming Mobile Communication Based on Reinforcement Learning0
Two geometric input transformation methods for fast online reinforcement learning with neural nets0
Two-Hop Age of Information Scheduling for Multi-UAV Assisted Mobile Edge Computing: FRL vs MADDPG0
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks0
Efficiently Training Deep-Learning Parametric Policies using Lagrangian Duality0
Two-stage training algorithm for AI robot soccer0
Two-Step Reinforcement Learning for Multistage Strategy Card Game0
UAS Navigation in the Real World Using Visual Observation0
UAS Visual Navigation in Large and Unseen Environments via a Meta Agent0
UAV aided Metaverse over Wireless Communications: A Reinforcement Learning Approach0
UAV Aided Search and Rescue Operation Using Reinforcement Learning0
UAV-Assisted Coverage Hole Detection Using Reinforcement Learning in Urban Cellular Networks0
UAV-Assisted Enhanced Coverage and Capacity in Dynamic MU-mMIMO IoT Systems: A Deep Reinforcement Learning Approach0
UAV-assisted Semantic Communication with Hybrid Action Reinforcement Learning0
UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations0
UAV Path Planning Employing MPC- Reinforcement Learning Method Considering Collision Avoidance0
UAV Trajectory Planning in Wireless Sensor Networks for Energy Consumption Minimization by Deep Reinforcement Learning0
Show:102550
← PrevPage 277 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified