SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1275112800 of 15113 papers

TitleStatusHype
Neural Program Planner for Structured Predictions0
Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions0
Symbolic Regression Methods for Reinforcement Learning0
Explaining Reinforcement Learning to Mere Mortals: An Empirical Study0
Deep Hierarchical Reinforcement Learning Based Recommendations via Multi-goals Abstraction0
DQN with model-based exploration: efficient learning on environments with sparse rewards0
Optimization Methods for Interpretable Differentiable Decision Trees in Reinforcement LearningCode1
Improving Safety in Reinforcement Learning Using Model-Based Architectures and Human Intervention0
Jet grooming through reinforcement learningCode0
Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder0
Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus0
End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control TasksCode0
Augmented Memory Networks for Streaming-Based Active One-Shot Learning0
Single-step Options for Adversary Driving0
Optimizing thermodynamic trajectories using evolutionary and gradient-based reinforcement learningCode0
Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation0
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context VariablesCode0
Hindsight Generative Adversarial Imitation Learning0
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL0
Deep Reinforcement Learning with Decorrelation0
A Comparison of Prediction Algorithms and Nexting for Short Term Weather Forecasts0
Adaptive Genomic Evolution of Neural Network Topologies (AGENT) for State-to-Action Mapping in Autonomous Agents0
Learning proposals for sequential importance samplers using reinforced variational inference0
Robust Reinforcement Learning for Autonomous Driving0
Multi-agent query reformulation: Challenges and the role of diversity0
Online Antenna Tuning in Heterogeneous Cellular Networks with Deep Reinforcement Learning0
Policy Distillation and Value Matching in Multiagent Reinforcement Learning0
A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement LearningCode0
Can User-Centered Reinforcement Learning Allow a Robot to Attract Passersby without Causing Discomfort?0
gym-gazebo2, a toolkit for reinforcement learning using ROS 2 and GazeboCode0
No-regret Exploration in Contextual Reinforcement Learning0
Deep Reinforcement Learning with Feedback-based ExplorationCode0
On Applications of Bootstrap in Continuous Space Reinforcement Learning0
Reinforcement Learning with Dynamic Boltzmann Softmax UpdatesCode0
ROS2Learn: a reinforcement learning framework for ROS 2Code0
CoaCor: Code Annotation for Code Retrieval with Reinforcement LearningCode0
Effective reinforcement learning based local search for the maximum k-plex problem0
Task-oriented Design through Deep Reinforcement Learning0
Resource Abstraction for Reinforcement Learning in Multiagent Congestion Problems0
Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning0
VRKitchen: an Interactive 3D Virtual Environment for Task-oriented LearningCode0
A Review of Reinforcement Learning for Autonomous Building Energy Management0
Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces0
Hybrid Reinforcement Learning with Expert State SequencesCode0
Accelerating Minibatch Stochastic Gradient Descent using Typicality Sampling0
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy CriticsCode0
Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal ControlCode0
Learning to Paint With Model-based Deep Reinforcement LearningCode1
Deep learning for molecular design - a review of the state of the art0
DeepPool: Distributed Model-free Algorithm for Ride-sharing using Deep Reinforcement Learning0
Show:102550
← PrevPage 256 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified