SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 64516475 of 15113 papers

TitleStatusHype
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems0
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics0
Active Reinforcement Learning -- A Roadmap Towards Curious Classifier Systems for Self-Adaptation0
In Defense of the Unitary Scalarization for Deep Multi-Task LearningCode1
STIR^2: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks0
Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making0
Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph0
State of the Art of User Simulation approaches for conversational information retrieval0
Opportunities of Hybrid Model-based Reinforcement Learning for Cell Therapy Manufacturing Process Control0
When is Offline Two-Player Zero-Sum Markov Game Solvable?0
Verified Probabilistic Policies for Deep Reinforcement LearningCode1
A Multi-agent Reinforcement Learning Approach for Efficient Client Selection in Federated Learning0
Assessing Policy, Loss and Planning Combinations in Reinforcement Learning using a New Modular Architecture0
Mirror Learning: A Unifying Framework of Policy OptimisationCode1
Neural Network Optimization for Reinforcement Learning Tasks Using Sparse Computations0
Offline Reinforcement Learning for Road Traffic Control0
SABLAS: Learning Safe Control for Black-box Dynamical SystemsCode1
Combining Reinforcement Learning and Inverse Reinforcement Learning for Asset Allocation Recommendations0
Sample Efficient Deep Reinforcement Learning via Uncertainty EstimationCode1
Offsetting Unequal Competition through RL-assisted Incentive Schemes0
Using Simulation Optimization to Improve Zero-shot Policy Transfer of QuadrotorsCode1
Deep Reinforcement Learning, a textbook0
Deep Learning-based Predictive Control of Battery Management for Frequency RegulationCode0
Learning Complex Spatial Behaviours in ABM: An Experimental Observational Study0
Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning0
Show:102550
← PrevPage 259 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified