Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1601–1650 of 15113 papers

Title	Date	Tasks	Status	Hype
Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing	Feb 15, 2021	Deep Reinforcement LearningMulti-agent Reinforcement Learning	CodeCode Available	1
LTL2Action: Generalizing LTL Instructions for Multi-Task RL	Feb 13, 2021	Deep Reinforcement LearningDiversity	CodeCode Available	1
Scalable Bayesian Inverse Reinforcement Learning	Feb 12, 2021	Bayesian InferenceImitation Learning	CodeCode Available	1
Multi-Task Reinforcement Learning with Context-based Representations	Feb 11, 2021	Multi-Task Learningreinforcement-learning	CodeCode Available	1
Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision	Feb 10, 2021	Board GamesModel-based Reinforcement Learning	CodeCode Available	1
Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation	Feb 10, 2021	Autonomous DrivingDeep Reinforcement Learning	CodeCode Available	1
Risk-Averse Offline Reinforcement Learning	Feb 10, 2021	reinforcement-learningReinforcement Learning	CodeCode Available	1
Reverb: A Framework For Experience Replay	Feb 9, 2021	Reinforcement Learning (RL)	CodeCode Available	1
rl_reach: Reproducible Reinforcement Learning Experiments for Robotic Reaching Tasks	Feb 9, 2021	reinforcement-learningReinforcement Learning	CodeCode Available	1
Continuous-Time Model-Based Reinforcement Learning	Feb 9, 2021	modelModel-based Reinforcement Learning	CodeCode Available	1
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads	Feb 8, 2021	CPUDeep Reinforcement Learning	CodeCode Available	1
Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning	Feb 8, 2021	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	1
Tactical Optimism and Pessimism for Deep Reinforcement Learning	Feb 7, 2021	continuous-controlContinuous Control	CodeCode Available	1
Explainable Reinforcement Learning for Longitudinal Control	Feb 6, 2021	Deep Reinforcement LearningOpenAI Gym	CodeCode Available	1
LongiControl: A Reinforcement Learning Environment for Longitudinal Vehicle Control	Feb 6, 2021	Autonomous DrivingOpenAI Gym	CodeCode Available	1
Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning	Feb 6, 2021	Multi-agent Reinforcement Learningreinforcement-learning	CodeCode Available	1
Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning	Feb 5, 2021	Graph EmbeddingModel Compression	CodeCode Available	1
Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents	Feb 4, 2021	Meta-LearningMeta Reinforcement Learning	CodeCode Available	1
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning	Feb 1, 2021	Offline RLreinforcement-learning	CodeCode Available	1
Multi-Agent Reinforcement Learning with Temporal Logic Specifications	Feb 1, 2021	Multi-agent Reinforcement Learningreinforcement-learning	CodeCode Available	1
Contextualized Rewriting for Text Summarization	Jan 31, 2021	Extractive Summarizationreinforcement-learning	CodeCode Available	1
Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies	Jan 24, 2021	Acrobotreinforcement-learning	CodeCode Available	1
Differentiable Trust Region Layers for Deep Reinforcement Learning	Jan 22, 2021	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Robust Reinforcement Learning on State Observations with Learned Optimal Adversary	Jan 21, 2021	Adversarial Attackcontinuous-control	CodeCode Available	1
Unifying Cardiovascular Modelling with Deep Reinforcement Learning for Uncertainty Aware Control of Sepsis Treatment	Jan 21, 2021	Clinical KnowledgeDecision Making Under Uncertainty	CodeCode Available	1
mt5se: An Open Source Framework for Building Autonomous Trading Robots	Jan 20, 2021	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers	Jan 20, 2021	Multi-agent Reinforcement Learningreinforcement-learning	CodeCode Available	1
Towards Facilitating Empathic Conversations in Online Mental Health Support: A Reinforcement Learning Approach	Jan 19, 2021	Deep Reinforcement LearningDialogue Generation	CodeCode Available	1
Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes	Jan 19, 2021	Deep Reinforcement LearningPosition	CodeCode Available	1
Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning	Jan 19, 2021	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	1
Deep Reinforcement Learning for Active High Frequency Trading	Jan 18, 2021	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Hierarchical Reinforcement Learning By Discovering Intrinsic Options	Jan 16, 2021	Hierarchical Reinforcement Learningreinforcement-learning	CodeCode Available	1
Controlling the Risk of Conversational Search via Reinforcement Learning	Jan 15, 2021	Conversational Searchreinforcement-learning	CodeCode Available	1
Evaluating Soccer Player: from Live Camera to Deep Reinforcement Learning	Jan 13, 2021	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Memory-Augmented Reinforcement Learning for Image-Goal Navigation	Jan 13, 2021	Data AugmentationNavigate	CodeCode Available	1
Implicit Unlikelihood Training: Improving Neural Text Generation with Reinforcement Learning	Jan 11, 2021	Language ModelingLanguage Modelling	CodeCode Available	1
Cross-Modal Contrastive Learning of Representations for Navigation using Lightweight, Low-Cost Millimeter Wave Radar for Adverse Environmental Conditions	Jan 10, 2021	Autonomous NavigationContrastive Learning	CodeCode Available	1
Simulating SQL Injection Vulnerability Exploitation Using Q-Learning Reinforcement Learning Agents	Jan 8, 2021	Q-Learningreinforcement-learning	CodeCode Available	1
A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules	Jan 8, 2021	DecoderDeep Reinforcement Learning	CodeCode Available	1
Evolving Reinforcement Learning Algorithms	Jan 8, 2021	Atari GamesMeta-Learning	CodeCode Available	1
The Distracting Control Suite -- A Challenging Benchmark for Reinforcement Learning from Pixels	Jan 7, 2021	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	1
Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning	Jan 7, 2021	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	1
Reinforcement Learning with Latent Flow	Jan 6, 2021	Atari Gamescontinuous-control	CodeCode Available	1
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control	Jan 4, 2021	Deep Reinforcement LearningMeta-Learning	CodeCode Available	1
Multi-Agent Trust Region Learning	Jan 1, 2021	Atari GamesMuJoCo	CodeCode Available	1
Cross-Modal Domain Adaptation for Reinforcement Learning	Jan 1, 2021	Domain AdaptationMuJoCo	CodeCode Available	1
Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination by Multi-Critic Policy Gradient Optimization	Dec 31, 2020	Collision AvoidanceManagement	CodeCode Available	1
Model-Based Visual Planning with Self-Supervised Functional Distances	Dec 30, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
Reinforcement Learning for Control of Valves	Dec 29, 2020	OpenAI Gymreinforcement-learning	CodeCode Available	1
Augmenting Policy Learning with Routines Discovered from a Single Demonstration	Dec 23, 2020	Atari GamesImitation Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 33 of 303Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified