SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1260112650 of 15113 papers

TitleStatusHype
The Game of Tetris in Machine LearningCode0
A Survey of Adaptive Resonance Theory Neural Network Models for Engineering Applications0
Face Hallucination by Attentive Sequence Optimization with Reinforcement Learning0
Hierarchical Policy Learning is Sensitive to Goal Space Design0
Deep Residual Reinforcement Learning0
Meta-learners' learning dynamics are unlike learners'0
Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach0
Collaborative Evolutionary Reinforcement LearningCode0
Adaptive Intelligent Secondary Control of Microgrids Using a Biologically-Inspired Reinforcement Learning0
Driving with Style: Inverse Reinforcement Learning in General-Purpose Planning for Automated Driving0
Efficient Model-free Reinforcement Learning in Metric SpacesCode0
Information-Theoretic Considerations in Batch Reinforcement Learning0
Modeling the Long Term Future in Model-Based Reinforcement Learning0
Visceral Machines: Reinforcement Learning with Intrinsic Physiological Rewards0
Sample-efficient policy learning in multi-agent Reinforcement Learning via meta-learning0
Uncovering Surprising Behaviors in Reinforcement Learning via Worst-case Analysis0
Understanding & Generalizing AlphaGo Zero0
Recurrent Experience Replay in Distributed Reinforcement LearningCode0
Towards Consistent Performance on Atari using Expert Demonstrations0
SIMILE: Introducing Sequential Information towards More Effective Imitation Learning0
NEURAL MALWARE CONTROL WITH DEEP REINFORCEMENT LEARNING0
M^3RL: Mind-aware Multi-agent Management Reinforcement Learning0
Explicit Recall for Efficient Exploration0
Learning To Solve Circuit-SAT: An Unsupervised Differentiable Approach0
Learning to Progressively Plan0
Learning to Decompose Compound Questions with Reinforcement Learning0
Learning to Control Visual Abstractions for Structured Exploration in Deep Reinforcement Learning0
ACTRCE: Augmenting Experience via Teacher’s Advice0
Inducing Cooperation via Learning to reshape rewards in semi-cooperative multi-agent reinforcement learning0
Few-Shot Intent Inference via Meta-Inverse Reinforcement Learning0
Learning Goal-Conditioned Value Functions with one-step Path rewards rather than Goal-Rewards0
Deep reinforcement learning with relational inductive biases0
Learning agents with prioritization and parameter noise in continuous state and action space0
Learning Actionable Representations with Goal Conditioned Policies0
A Guider Network for Multi-Dual Learning0
A new dog learns old tricks: RL finds classic optimization algorithms0
Backplay: 'Man muss immer umkehren'0
Automata Guided Skill Composition0
Learning Heuristics for Automated Reasoning through Reinforcement Learning0
Learning to Reinforcement Learn by Imitation0
DHER: Hindsight Experience Replay for Dynamic GoalsCode0
Soft Q-Learning with Mutual-Information Regularization0
SUPERVISED POLICY UPDATECode0
Rating Continuous Actions in Spatial Multi-Agent Problems0
Predicted Variables in Programming0
Generative Adversarial Imagination for Sample Efficient Deep Reinforcement Learning0
Argus: Smartphone-enabled Human Cooperation via Multi-Agent Reinforcement Learning for Disaster Situational Awareness0
Challenges of Real-World Reinforcement LearningCode1
Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications Outside Coverage0
Deep Neuroevolution of Recurrent and Discrete World ModelsCode0
Show:102550
← PrevPage 253 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified