SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 88268850 of 15113 papers

TitleStatusHype
Reverb: A Framework For Experience ReplayCode1
Continuous-Time Model-Based Reinforcement LearningCode1
Learning State Representations from Random Deep Action-conditional PredictionsCode0
Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning0
Introduction to Machine Learning for the Sciences0
Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement LearningCode1
Generate and Revise: Reinforcement Learning in Neural Poetry0
Learning Optimal Strategies for Temporal Tasks in Stochastic Games0
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature0
Neurogenetic Programming Framework for Explainable Reinforcement LearningCode0
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning WorkloadsCode1
Unlocking Pixels for Reinforcement Learning via Implicit Attention0
Sparsely ensembled convolutional neural network classifiers via reinforcement learningCode0
An Analysis of Frame-skipping in Reinforcement Learning0
Tactical Optimism and Pessimism for Deep Reinforcement LearningCode1
Multi-Agent Deep Reinforcement Learning for Request Dispatching in Distributed-Controller Software-Defined Networking0
LongiControl: A Reinforcement Learning Environment for Longitudinal Vehicle ControlCode1
Explainable Reinforcement Learning for Longitudinal ControlCode1
A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum0
MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management0
Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement LearningCode1
Improving Model and Search for Computer Go0
A bandit approach to curriculum generation for automatic speech recognition0
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning0
Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement LearningCode1
Show:102550
← PrevPage 354 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified