SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 33013325 of 15113 papers

TitleStatusHype
A SUMO Framework for Deep Reinforcement Learning Experiments Solving Electric Vehicle Charging Dispatching Problem0
Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control0
A Succinct Summary of Reinforcement Learning0
Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems0
Decentralized Federated Reinforcement Learning for User-Centric Dynamic TFDD Control0
Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach0
Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks0
Decentralized Graph-Based Multi-Agent Reinforcement Learning Using Reward Machines0
Deep reinforcement learning for quantum multiparameter estimation0
Decentralized model-free reinforcement learning in stochastic games with average-reward objective0
A Gentle Lecture Note on Filtrations in Reinforcement Learning0
A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems0
AdaMemento: Adaptive Memory-Assisted Policy Optimization for Reinforcement Learning0
Decentralized Multi-Agent Reinforcement Learning for Task Offloading Under Uncertainty0
Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method0
Decentralized Multi-Robot Formation Control Using Reinforcement Learning0
Decentralized Reinforcement Learning for Multi-Target Search and Detection by a Team of Drones0
Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions0
Deep reinforcement learning for RAN optimization and control0
Decentralized Safe Reinforcement Learning for Voltage Control0
Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments0
Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks0
CQM: Curriculum Reinforcement Learning with a Quantized World Model0
Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning0
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning0
Show:102550
← PrevPage 133 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified