SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1450114550 of 15113 papers

TitleStatusHype
Exploration--Exploitation in MDPs with Options0
Cohesion-based Online Actor-Critic Reinforcement Learning for mHealth Intervention0
Unsupervised Basis Function Adaptation for Reinforcement Learning0
Fake News Mitigation via Point Process Based Intervention0
Faster Reinforcement Learning Using Active SimulatorsCode0
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement LearningCode0
Deep Exploration via Randomized Value Functions0
Investigation of Language Understanding Impact for Reinforcement Learning Based Dialogue Systems0
Pseudorehearsal in value function approximation0
Black-Box Data-efficient Policy Search for RoboticsCode0
Learning Cooperative Visual Dialog Agents with Deep Reinforcement LearningCode1
Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options0
Online Learning for Offloading and Autoscaling in Energy Harvesting Mobile Edge Computing0
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability0
Particle Value Functions0
Using Reinforcement Learning for Demand Response of Domestic Hot Water Buffers: a Real-Life Demonstration0
Minimax Regret Bounds for Reinforcement LearningCode0
Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning0
Sensor Fusion for Robot Control through Deep Reinforcement Learning0
Reinforcement Learning for Transition-Based Mention Detection0
A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning0
Micro-Objective Learning : Accelerating Deep Reinforcement Learning through the Discovery of Continuous Subgoals0
Communications that Emerge through Reinforcement Learning Using a (Recurrent) Neural Network0
Evolution Strategies as a Scalable Alternative to Reinforcement LearningCode1
Sample Efficient Feature Selection for Factored MDPs0
What can you do with a rock? Affordance extraction via word embeddings0
Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksCode1
Robust Adversarial Reinforcement LearningCode1
Tree-Structured Reinforcement Learning for Sequential Object Localization0
Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning0
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute DetectionCode0
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents0
Functions that Emerge through End-to-End Reinforcement Learning - The Direction for Artificial General Intelligence -0
Neural Episodic ControlCode0
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning0
Third-Person Imitation LearningCode0
Unsupervised Basis Function Adaptation for Reinforcement Learning0
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction0
FeUdal Networks for Hierarchical Reinforcement LearningCode0
Generalised Discount Functions applied to a Monte-Carlo AImu ImplementationCode0
Actor-Critic Reinforcement Learning with Simultaneous Human Control and Feedback0
EX2: Exploration with Exemplar Models for Deep Reinforcement LearningCode0
Multi-step Reinforcement Learning: A Unifying Algorithm0
A Laplacian Framework for Option Discovery in Reinforcement LearningCode0
Reinforcement Learning for Pivoting TaskCode0
Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless NavigationCode1
Learning to Optimize Neural Nets0
Show, Attend and Interact: Perceivable Human-Robot Social Interaction through Neural Attention Q-Network0
Analysing Congestion Problems in Multi-agent Reinforcement Learning0
Analysis of Agent Expertise in Ms. Pac-Man using Value-of-Information-based Policies0
Show:102550
← PrevPage 291 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified