SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1100111050 of 15113 papers

TitleStatusHype
AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement LearningCode1
Learning Force Control for Contact-rich Manipulation Tasks with Rigid Position-controlled Robots0
Fully Asynchronous Policy Evaluation in Distributed Reinforcement Learning over Networks0
A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement LearningCode0
Learning When and Where to Zoom with Deep Reinforcement LearningCode1
Learning Near Optimal Policies with Low Inherent Bellman Error0
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts0
TAdam: A Robust Stochastic Gradient OptimizerCode0
Mixed Reinforcement Learning with Additive Stochastic Uncertainty0
A Self-Tuning Actor-Critic Algorithm0
On Catastrophic Interference in Atari 2600 GamesCode0
Deep Reinforcement Learning for FlipIt Security Game0
Reinforcement Learning through Active Inference0
Deep Reinforcement Learning Based Intelligent Reflecting Surface for Secure Wireless Communications0
Towards Modular Algorithm Induction0
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games0
Learning in Markov Decision Processes under Constraints0
A Visual Communication Map for Multi-Agent Deep Reinforcement Learning0
Autonomous robotic nanofabrication with reinforcement learningCode0
Assembly robots with optimized control stiffness through reinforcement learning0
Acceleration of Actor-Critic Deep Reinforcement Learning for Visual Grasping in Clutter by State Representation Learning Based on Disentanglement of a Raw Input Image0
Analysis of diversity-accuracy tradeoff in image captioningCode1
Cautious Reinforcement Learning via Distributional Risk in the Dual Domain0
Review, Analysis and Design of a Comprehensive Deep Reinforcement Learning Framework0
Training Adversarial Agents to Exploit Weaknesses in Deep Control PoliciesCode0
Sub-Goal Trees -- a Framework for Goal-Based Reinforcement Learning0
Reinforcement Learning of Risk-Constrained Policies in Markov Decision ProcessesCode0
Neural Ordinary Differential Equation Value Networks for Parametrized Action Spaces0
Optimistic Exploration even with a Pessimistic InitialisationCode1
Using Reinforcement Learning in the Algorithmic Trading ProblemCode1
Cautious Reinforcement Learning with Logical Constraints0
Efficient reinforcement learning control for continuum robots based on Inexplicit Prior KnowledgeCode0
Generalized Hindsight for Reinforcement Learning0
Mid-flight Propeller Failure Detection and Control of Propeller-deficient Quadcopter using Reinforcement LearningCode0
When Do Drivers Concentrate? Attention-based Driver Behavior Modeling With Deep Reinforcement Learning0
Scalable Multi-Task Imitation Learning with Autonomous Improvement0
Whole-Body Control of a Mobile Manipulator using End-to-End Reinforcement LearningCode1
Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization0
On Reinforcement Learning for Turn-based Zero-sum Markov Games0
Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements0
Rewriting History with Inverse RL: Hindsight Inference for Policy ImprovementCode1
Off-Policy Deep Reinforcement Learning with Analogous Disentangled ExplorationCode0
Reward Shaping for Human Learning via Inverse Reinforcement LearningCode0
G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning0
Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity0
Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approachCode0
Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning0
Reconfigurable Intelligent Surface Assisted Multiuser MISO Systems Exploiting Deep Reinforcement LearningCode1
Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic0
Wireless 2.0: Towards an Intelligent Radio Environment Empowered by Reconfigurable Meta-Surfaces and Artificial Intelligence0
Show:102550
← PrevPage 221 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified