SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 34763500 of 15113 papers

TitleStatusHype
Automata-Guided Hierarchical Reinforcement Learning for Skill Composition0
Deep Q-Learning for Directed Acyclic Graph Generation0
AUTOMATA GUIDED HIERARCHICAL REINFORCEMENT LEARNING FOR ZERO-SHOT SKILL COMPOSITION0
Counterfactual Explanation Policies in RL0
Automata Guided Reinforcement Learning With Demonstrations0
Deep Q-Learning Market Makers in a Multi-Agent Simulated Stock Market0
Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task0
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment0
Deep Q-Network Based Multi-agent Reinforcement Learning with Binary Action Agents0
Deep Q-Network (DQN) multi-agent reinforcement learning (MARL) for Stock Trading0
Deep Q-Network for AI Soccer0
A Strong Baseline for Batch Imitation Learning0
Counterfactual Credit Assignment in Model-Free Reinforcement Learning0
A physics-informed reinforcement learning approach for the interfacial area transport in two-phase flow0
Agent based modelling for continuously varying supply chains0
Automated Database Indexing using Model-free Reinforcement Learning0
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning0
Deep Randomized Least Squares Value Iteration0
Deep Radial-Basis Value Functions for Continuous Control0
Accelerating the Computation of UCB and Related Indices for Reinforcement Learning0
Deep reinforced active learning for multi-class image classification0
Deep Reinforced Self-Attention Masks for Abstractive Summarization (DR.SAS)0
Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification0
Deep Reinforcement Fuzzing0
Deep RL With Information Constrained Policies: Generalization in Continuous Control0
Show:102550
← PrevPage 140 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified