SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1160111650 of 15113 papers

TitleStatusHype
Gym-Ignition: Reproducible Robotic Simulations for Reinforcement LearningCode0
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning0
Fully Parameterized Quantile Function for Distributional Reinforcement LearningCode0
A Deep Reinforcement Learning Approach to First-Order Logic Theorem ProvingCode1
Quinoa: a Q-function You Infer Normalized Over Actions0
Robo-advising: Learning Investors' Risk Preferences via Portfolio Choices0
Robotic Tracking Control with Kernel Trick-based Reinforcement Learning0
An End-to-End Deep RL Framework for Task Arrangement in Crowdsourcing Platforms0
Learning from Trajectories via Subgoal DiscoveryCode0
Non-Cooperative Inverse Reinforcement Learning0
Online Robustness Training for Deep Reinforcement Learning0
Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs0
Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy Reinforcement Learning0
On Solving the 2-Dimensional Greedy Shooter Problem for UAVsCode0
Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints0
Explicit Explore-Exploit Algorithms in Continuous State SpacesCode0
Frequentist Regret Bounds for Randomized Least-Squares Value IterationCode0
Generating Formality-Tuned Summaries Using Input-Dependent Rewards0
DIVINE: A Generative Adversarial Imitation Learning Framework for Knowledge Graph Reasoning0
Deep Reinforcement Learning-based Text Anonymization against Private-Attribute Inference0
Incorporating Graph Attention Mechanism into Knowledge Graph Reasoning Based on Deep Reinforcement Learning0
Generalized Speedy Q-learningCode0
Learning the Extraction Order of Multiple Relational Facts in a Sentence with Reinforcement Learning0
A2: Extracting Cyclic Switchings from DOB-nets for Rejecting Excessive Disturbances0
Exploring Diverse Expressions for Paraphrase Generation0
Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning0
Positive-Unlabeled Reward Learning0
Neural Topic Model with Reinforcement Learning0
Answer-Supervised Question Reformulation for Enhancing Conversational Machine Comprehension0
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering0
Hierarchical Expert Networks for Meta-Learning0
PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement LearningCode1
VASE: Variational Assorted Surprise Exploration for Reinforcement Learning0
RLINK: Deep Reinforcement Learning for User Identity Linkage0
Cascaded LSTMs based Deep Reinforcement Learning for Goal-driven DialogueCode0
Learning Algorithmic Solutions to Symbolic Planning Tasks with a Neural Computer Architecture0
DADI: Dynamic Discovery of Fair Information with Adversarial Reinforcement Learning0
A Distributed Model-Free Algorithm for Multi-hop Ride-sharing using Deep Reinforcement Learning0
RBED: Reward Based Epsilon Decay0
Policy Continuation with Hindsight Inverse DynamicsCode0
Multimodal Model-Agnostic Meta-Learning via Task-Aware ModulationCode1
Deep reinforcement learning for market making in corporate bonds: beating the curse of dimensionality0
Deep Reinforcement Learning for Distributed Uncoordinated Cognitive Radios Resource Allocation0
Adaptive Sampling Quasi-Newton Methods for Derivative-Free Stochastic Optimization0
Feedback Linearization for Unknown Systems via Reinforcement Learning0
Learning to Manipulate Deformable Objects without DemonstrationsCode1
Navigation Agents for the Visually Impaired: A Sidewalk Simulator and ExperimentsCode0
Deep Decentralized Reinforcement Learning for Cooperative Control0
Overcoming Catastrophic Interference in Online Reinforcement Learning with Dynamic Self-Organizing Maps0
Robust Model-free Reinforcement Learning with Multi-objective Bayesian Optimization0
Show:102550
← PrevPage 233 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified