SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 25512575 of 15113 papers

TitleStatusHype
Automated Proof of Polynomial Inequalities via Reinforcement LearningCode0
Automated Optical Multi-layer Design via Deep Reinforcement LearningCode0
Adaptive Estimator Selection for Off-Policy EvaluationCode0
Join Query Optimization with Deep Reinforcement Learning AlgorithmsCode0
Automated Image Data Preprocessing with Deep Reinforcement LearningCode0
Iterative Reward Shaping using Human Feedback for Correcting Reward MisspecificationCode0
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of InterestingnessCode0
Jet grooming through reinforcement learningCode0
Jointly Learning to Construct and Control Agents using Deep Reinforcement LearningCode0
Automated Gadget Discovery in ScienceCode0
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?Code0
Automated Discovery of Local Rules for Desired Collective-Level Behavior Through Reinforcement LearningCode0
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for HanabiCode0
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing fieldCode0
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical GuaranteesCode0
Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement LearningCode0
Automated Curriculum Learning by Rewarding Temporally Rare EventsCode0
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement LearningCode0
Automata Learning meets ShieldingCode0
Inverse Reinforcement Learning in Contextual MDPsCode0
Adaptive Discretization for Model-Based Reinforcement LearningCode0
Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement LearningCode0
Adaptive Discretization for Episodic Reinforcement Learning in Metric SpacesCode0
Inverse reinforcement learning for video gamesCode0
IRLAS: Inverse Reinforcement Learning for Architecture SearchCode0
Show:102550
← PrevPage 103 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified