SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 997610000 of 15113 papers

TitleStatusHype
Multi-Agent Reinforcement Learning in Cournot Games0
Multiagent Reinforcement Learning in Games with an Iterated Dominance Solution0
Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for Cellular Offloading0
Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation0
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization0
Multi-agent Reinforcement Learning with Sparse Interactions by Negotiation and Knowledge Transfer0
Multi-Agent Reinforcement Learning with Shared Resource in Inventory Management0
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management0
Multi-Agent Reinforcement Learning with Common Policy for Antenna Tilt Optimization0
Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing0
Multi-Agent Reinforcement Learning with Graph Convolutional Neural Networks for optimal Bidding Strategies of Generation Units in Electricity Markets0
Multi-agent Reinforcement Learning with Graph Q-Networks for Antenna Tuning0
Multi-Agent Reinforcement Learning with Multi-Step Generative Models0
Multi-Agent Safe Policy Learning for Power Management of Networked Microgrids0
Emergent Social Learning via Multi-agent Reinforcement Learning0
Multiagent Soft Q-Learning0
Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective0
Multi-Agent Transfer Learning in Reinforcement Learning-Based Ride-Sharing Systems0
Multiagent Value Iteration Algorithms in Dynamic Programming and Reinforcement Learning0
Multi-Armed Bandits and Quantum Channel Oracles0
Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement Learning0
Multi-batch Reinforcement Learning via Sample Transfer and Imitation Learning0
Multibit Tries Packet Classification with Deep Reinforcement Learning0
Multi-Class Multi-Annotator Active Learning With Robust Gaussian Process for Visual Recognition0
Multi-compartment Neuron and Population Encoding Powered Spiking Neural Network for Deep Distributional Reinforcement Learning0
Show:102550
← PrevPage 400 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified