SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 39764000 of 15113 papers

TitleStatusHype
Fast deep reinforcement learning using online adjustments from the pastCode0
Fast Rates for Maximum Entropy ExplorationCode0
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue SystemsCode0
Fast, Accurate and Lightweight Super-Resolution with Neural Architecture SearchCode0
Fairness Through Counterfactual UtilitiesCode0
Action-Decision Networks for Visual Tracking With Deep Reinforcement LearningCode0
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement LearningCode0
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documentsCode0
Shaping Advice in Deep Multi-Agent Reinforcement LearningCode0
Detecting Spiky Corruption in Markov Decision ProcessesCode0
Bayesian Curiosity for Efficient Exploration in Reinforcement LearningCode0
FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning AgentsCode0
External Model Motivated Agents: Reinforcement Learning for Enhanced Environment SamplingCode0
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from ObservationsCode0
A Comparison of Reinforcement Learning Frameworks for Software Testing TasksCode0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
Extending Environments To Measure Self-Reflection In Reinforcement LearningCode0
Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic FunctionsCode0
Extended Markov Games to Learn Multiple Tasks in Multi-Agent Reinforcement LearningCode0
Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option TemplatesCode0
Case-Based Inverse Reinforcement Learning Using Temporal CoherenceCode0
Sim-Env: Decoupling OpenAI Gym Environments from Simulation ModelsCode0
SimpleDS: A Simple Deep Reinforcement Learning Dialogue SystemCode0
Exponential Family Model-Based Reinforcement Learning via Score MatchingCode0
Safety Augmented Value Estimation from Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic TasksCode0
Show:102550
← PrevPage 160 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified