SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1340113450 of 15113 papers

TitleStatusHype
Investigating Enactive Learning for Autonomous Intelligent Agents0
Continual State Representation Learning for Reinforcement Learning using Generative Replay0
Distributed Wildfire Surveillance with Autonomous Aircraft using Deep Reinforcement Learning0
Actor-Critic Deep Reinforcement Learning for Dynamic Multichannel Access0
Multi-agent Deep Reinforcement Learning for Zero Energy Communities0
SFV: Reinforcement Learning of Physical Skills from VideosCode0
Reinforcement Evolutionary Learning Method for self-learning0
Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural NetworksCode0
PPO-CMA: Proximal Policy Optimization with Covariance Matrix AdaptationCode0
MyCaffe: A Complete C# Re-Write of Caffe with Reinforcement LearningCode0
Learning Scheduling Algorithms for Data Processing ClustersCode0
Deep Reinforcement Learning for Time Scheduling in RF-Powered Backscatter Cognitive Radio Networks0
Comparison of Reinforcement Learning algorithms applied to the Cart Pole problemCode0
Efficient Dialog Policy Learning via Positive Memory RetentionCode0
Energy-Based Hindsight Experience PrioritizationCode0
EMI: Exploration with Mutual InformationCode0
Near-Optimal Representation Learning for Hierarchical Reinforcement LearningCode0
The Dreaming Variational Autoencoder for Reinforcement Learning EnvironmentsCode0
Reinforcement Learning with Perturbed RewardsCode0
Prediction Improves Simultaneous Neural Machine Translation0
SmartChoices: Hybridizing Programming and Machine Learning0
Automatic Essay Scoring Incorporating Rating Schema via Reinforcement Learning0
Autonomous Sub-domain Modeling for Dialogue Policy with Hierarchical Deep Reinforcement Learning0
A Teacher-Student Framework for Maintainable Dialog Manager0
Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management0
Logician and Orator: Learning from the Duality between Language and Knowledge in Open Domain0
Adaptive Multi-pass Decoder for Neural Machine Translation0
Automatic Poetry Generation with Mutual Reinforcement Learning0
Bayesian Transfer Reinforcement Learning with Prior Knowledge Rules0
Deep Quality-Value (DQV) LearningCode0
Few-Shot Goal Inference for Visuomotor Learning and Planning0
Learning to Perform Local Rewriting for Combinatorial OptimizationCode0
Using State Predictions for Value Regularization in Curiosity Driven Deep Reinforcement LearningCode0
Reinforcement Learning in R0
M^3RL: Mind-aware Multi-agent Management Reinforcement LearningCode0
Generalization and Regularization in DQNCode0
Direct optimization of F-measure for retrieval-based personal question answering0
Robot Representation and Reasoning with Knowledge from Reinforcement Learning0
Transfer Value or Policy? A Value-centric Framework Towards Transferrable Continuous Reinforcement Learning0
Towards More Theoretically-Grounded Particle Optimization Sampling for Deep Learning0
Mimicking actions is a good strategy for beginners: Fast Reinforcement Learning with Expert Action Sequences0
What Would pi* Do?: Imitation Learning via Off-Policy Reinforcement Learning0
Where Off-Policy Deep Reinforcement Learning Fails0
Successor Options : An Option Discovery Algorithm for Reinforcement Learning0
The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions0
Policy Generalization In Capacity-Limited Reinforcement Learning0
Shrinkage-based Bias-Variance Trade-off for Deep Reinforcement Learning0
Unsupervised Exploration with Deep Model-Based Reinforcement Learning0
Interactive Parallel Exploration for Reinforcement Learning in Continuous Action Spaces0
COLLABORATIVE MULTIAGENT REINFORCEMENT LEARNING IN HOMOGENEOUS SWARMS0
Show:102550
← PrevPage 269 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified