SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 72267250 of 15113 papers

TitleStatusHype
Multi-Agent Broad Reinforcement Learning for Intelligent Traffic Light Control0
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning0
Designing Heterogeneous GNNs with Desired Permutation Properties for Wireless Resource Allocation0
A Complete Characterization of Linear Estimators for Offline Policy Evaluation0
Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery0
Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping0
Knowledge Transfer in Deep Reinforcement Learning for Slice-Aware Mobility Robustness Optimization0
Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations0
Cascaded Gaps: Towards Gap-Dependent Regret for Risk-Sensitive Reinforcement Learning0
A Survey on Reinforcement Learning Methods in Character Animation0
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets0
Efficient Policy Generation in Multi-Agent Systems via Hypergraph Neural Network0
Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation0
Scalable multi-agent reinforcement learning for distributed control of residential energy flexibility0
Black-Box Safety Validation of Autonomous Systems: A Multi-Fidelity Reinforcement Learning Approach0
Reinforcement Learning for Location-Aware Scheduling0
On Credit Assignment in Hierarchical Reinforcement LearningCode0
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning0
Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit0
Watch from sky: machine-learning-based multi-UAV network for predictive police surveillance0
Hierarchically Structured Scheduling and Execution of Tasks in a Multi-Agent Environment0
Deep Reinforcement Learning based Model-free On-line Dynamic Multi-Microgrid Formation to Enhance Resilience0
Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations0
Depthwise Convolution for Multi-Agent Communication with Enhanced Mean-Field Approximation0
A Multi-Document Coverage Reward for RELAXed Multi-Document SummarizationCode0
Show:102550
← PrevPage 290 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified