Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2026–2050 of 15113 papers

Title	Date	Tasks	Status	Hype
Learning When and Where to Zoom with Deep Reinforcement Learning	Mar 1, 2020	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Analysis of diversity-accuracy tradeoff in image captioning	Feb 27, 2020	DiversityImage Captioning	CodeCode Available	1
Optimistic Exploration even with a Pessimistic Initialisation	Feb 26, 2020	Efficient ExplorationQ-Learning	CodeCode Available	1
Using Reinforcement Learning in the Algorithmic Trading Problem	Feb 26, 2020	Algorithmic Tradingreinforcement-learning	CodeCode Available	1
Whole-Body Control of a Mobile Manipulator using End-to-End Reinforcement Learning	Feb 25, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement	Feb 25, 2020	Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	1
Reconfigurable Intelligent Surface Assisted Multiuser MISO Systems Exploiting Deep Reinforcement Learning	Feb 24, 2020	Deep Reinforcement LearningNeural Network simulation	CodeCode Available	1
Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations	Feb 23, 2020	Atari GamesDecision Making	CodeCode Available	1
Reinforcement Learning Framework for Deep Brain Stimulation Study	Feb 22, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
How To Avoid Being Eaten By a Grue: Exploration Strategies for Text-Adventure Agents	Feb 19, 2020	Knowledge Graphsreinforcement-learning	CodeCode Available	1
Sim2Real Transfer for Reinforcement Learning without Dynamics Randomization	Feb 19, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
Reinforcement Learning for Molecular Design Guided by Quantum Mechanics	Feb 18, 2020	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Generating Automatic Curricula via Self-Supervised Active Domain Randomization	Feb 18, 2020	Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	1
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking	Feb 17, 2020	Gaussian ProcessesReinforcement Learning	CodeCode Available	1
R-MADDPG for Partially Observable Environments and Limited Communication	Feb 16, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
Reinforced active learning for image segmentation	Feb 16, 2020	Active LearningDeep Reinforcement Learning	CodeCode Available	1
PDDLGym: Gym Environments from PDDL Problems	Feb 15, 2020	Decision MakingOpenAI Gym	CodeCode Available	1
Deep RL Agent for a Real-Time Action Strategy Game	Feb 15, 2020	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality	Feb 14, 2020	Inductive BiasMetric Learning	CodeCode Available	1
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription	Feb 13, 2020	Decision Makingreinforcement-learning	CodeCode Available	1
Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems	Feb 13, 2020	Distributed Computingreinforcement-learning	CodeCode Available	1
Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization	Feb 11, 2020	Combinatorial OptimizationHyperparameter Optimization	CodeCode Available	1
Objective Mismatch in Model-based Reinforcement Learning	Feb 11, 2020	modelModel-based Reinforcement Learning	CodeCode Available	1
SparseIDS: Learning Packet Sampling with Reinforcement Learning	Feb 10, 2020	Computational EfficiencyEdge-computing	CodeCode Available	1
Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States	Feb 9, 2020	ArticlesManagement	CodeCode Available	1

Show:10 25 50

← PrevPage 82 of 605Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified