SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1215112200 of 15113 papers

TitleStatusHype
PrecoderNet: Hybrid Beamforming for Millimeter Wave Systems with Deep Reinforcement Learning0
Inverse Reinforcement Learning with Multiple Ranked Experts0
Control of nonlinear, complex and black-boxed greenhouse system with reinforcement learningCode0
Multi-Agent Adversarial Inverse Reinforcement LearningCode0
Wasserstein Robust Reinforcement Learning0
Model-Free Unsupervised Learning for Optimization Problems with Constraints0
Reward Learning for Efficient Reinforcement Learning in Extractive Document SummarisationCode0
DeepPlace: Learning to Place Applications in Multi-Tenant Clusters0
MineRL: A Large-Scale Dataset of Minecraft DemonstrationsCode0
Hindsight Trust Region Policy OptimizationCode0
Goal-Driven Sequential Data Abstraction0
Semantic RL with Action Grammars: Data-Efficient Learning of Hierarchical Task AbstractionsCode0
Taxable Stock Trading with Deep Reinforcement Learning0
Towards Model-based Reinforcement Learning for Industry-near EnvironmentsCode0
On Hard Exploration for Reinforcement Learning: a Case Study in Pommerman0
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment0
Deep Reinforcement Learning for Personalized Search Story Recommendation0
Environment Probing Interaction PoliciesCode0
Large scale continuous-time mean-variance portfolio allocation via reinforcement learning0
Action Guidance with MCTS for Deep Reinforcement Learning0
Interactive Lungs Auscultation with Reinforcement Learning Agent0
Google Research Football: A Novel Reinforcement Learning EnvironmentCode0
Dynamic Input for Deep Reinforcement Learning in Autonomous Driving0
AlphaStock: A Buying-Winners-and-Selling-Losers Investment Strategy using Interpretable Deep Reinforcement Attention Networks0
Fairness in Reinforcement Learning0
Learning Goal-Oriented Visual Dialog Agents: Imitating and Surpassing Analytic Experts0
Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning0
Metalearned Neural MemoryCode0
Structured Fusion Networks for DialogCode0
Modeling question asking using neural program generationCode0
Discourse Marker Augmented Network with Reinforcement Learning for Natural Language InferenceCode0
Deep Reinforcement Learning for Clinical Decision Support: A Brief Survey0
Agent Modeling as Auxiliary Task for Deep Reinforcement Learning0
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges0
Efficient Policy Learning for Non-Stationary MDPs under Adversarial Manipulation0
Surrogate Models for Enhancing the Efficiency of Neuroevolution in Reinforcement Learning0
VRLS: A Unified Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications0
Characterizing Attacks on Deep Reinforcement LearningCode0
Techniques for Automated Machine Learning0
Arena: a toolkit for Multi-Agent Reinforcement LearningCode0
An Actor-Critic-Attention Mechanism for Deep Reinforcement Learning in Multi-view Environments0
Accelerating Reinforcement Learning through GPU Atari EmulationCode0
Delegative Reinforcement Learning: learning to avoid traps with a little help0
Combinatorial Keyword Recommendations for Sponsored Search with Deep Reinforcement Learning0
Self-Attentional Credit Assignment for Transfer in Reinforcement LearningCode0
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery0
Convolutional Reservoir Computing for World ModelsCode0
Prioritized Guidance for Efficient Multi-Agent Reinforcement Learning Exploration0
Zermelo's problem: Optimal point-to-point navigation in 2D turbulent flows using Reinforcement Learning0
Photonic architecture for reinforcement learning0
Show:102550
← PrevPage 244 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified