SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1165111700 of 15113 papers

TitleStatusHype
Reset-Free Guided Policy Search: Efficient Deep Reinforcement Learning with Stochastic Initial States0
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention0
Residential Demand Response Applications Using Batch Reinforcement Learning0
Residual Feedback Learning for Contact-Rich Manipulation Tasks with Uncertainty0
Residual Learning from Demonstration: Adapting DMPs for Contact-rich Manipulation0
Residual Policy Learning for Powertrain Control0
Residual Q-Networks for Value Function Factorizing in Multi-Agent Reinforcement Learning0
Residual Reinforcement Learning for Robot Control0
Residual Reinforcement Learning from Demonstrations0
Resilient Autonomous Control of Distributed Multi-agent Systems in Contested Environments0
Resilient Computing with Reinforcement Learning on a Dynamical System: Case Study in Sorting0
Resilient Constrained Reinforcement Learning0
Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations0
Resilient Legged Local Navigation: Learning to Traverse with Compromised Perception End-to-End0
Resilient robot teams: a review integrating decentralised control, change-detection, and learning0
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning0
Resmax: An Alternative Soft-Greedy Operator for Reinforcement Learning0
Resolving Causal Confusion in Reinforcement Learning via Robust Exploration0
Resolving Congestions in the Air Traffic Management Domain via Multiagent Reinforcement Learning Methods0
Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning0
Resource Abstraction for Reinforcement Learning in Multiagent Congestion Problems0
Resource Allocation for a Wireless Coexistence Management System Based on Reinforcement Learning0
Resource Allocation in Mobility-Aware Federated Learning Networks: A Deep Reinforcement Learning Approach0
Resource Allocation in Multicore Elastic Optical Networks: A Deep Reinforcement Learning Approach0
The state-of-the-art review on resource allocation problem using artificial intelligence methods on various computing paradigms0
Resource Constrained Deep Reinforcement Learning0
Resource-Constrained Station-Keeping for Helium Balloons using Reinforcement Learning0
Resource Governance in Networked Systems via Integrated Variational Autoencoders and Reinforcement Learning0
Resource Management for Blockchain-enabled Federated Learning: A Deep Reinforcement Learning Approach0
Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning0
Resource Optimization for Tail-Based Control in Wireless Networked Control Systems0
Responding to Illegal Activities Along the Canadian Coastlines Using Reinforcement Learning0
Response to Comment on 'Perceptual Learning Incepted by Decoded fMRI Neurofeedback Without Stimulus Presentation'; How can a decoded neurofeedback method (DecNef) lead to successful reinforcement and visual perceptual learning?0
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods0
Responsive Safety in Reinforcement Learning0
Restarted Bayesian Online Change-point Detection for Non-Stationary Markov Decision Processes0
Restoring Chaos Using Deep Reinforcement Learning0
Rethink AI-based Power Grid Control: Diving Into Algorithm Design0
Rethinking Decision Transformer via Hierarchical Reinforcement Learning0
A Micro-Objective Perspective of Reinforcement Learning0
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning0
Rethinking Exposure Bias In Language Modeling0
Rethinking Modern Communication from Semantic Coding to Semantic Communication0
Rethinking Pareto Approaches in Constrained Reinforcement Learning0
Rethinking Population-assisted Off-policy Reinforcement Learning0
Rethinking Pruning for Backdoor Mitigation: An Optimization Perspective0
Rethinking Reinforcement Learning based Logic Synthesis0
Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective0
Rethinking Robustness Assessment: Adversarial Attacks on Learning-based Quadrupedal Locomotion Controllers0
Rethinking State Disentanglement in Causal Reinforcement Learning0
Show:102550
← PrevPage 234 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified