SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 84518500 of 15113 papers

TitleStatusHype
Multiple Tasks Integration: Tagging, Syntactic and Semantic Parsing as a Single Task0
Optimization Algorithm for Feedback and Feedforward Policies towards Robot Control Robust to Sensing Failures0
Trajectory Tracking of Underactuated Sea Vessels With Uncertain Dynamics: An Integral Reinforcement Learning Approach0
AdaPool: A Diurnal-Adaptive Fleet Management Framework using Model-Free Deep Reinforcement Learning and Change Point Detection0
Deep Reinforcement Learning for Constrained Field Development Optimization in Subsurface Two-phase Flow0
Generalized Reinforcement Learning for Building Control using Behavioral Cloning0
RLAD: Time Series Anomaly Detection through Reinforcement Learning and Active Learning0
Solving Heterogeneous General Equilibrium Economic Models with Deep Reinforcement Learning0
Simultaneous Navigation and Construction Benchmarking EnvironmentsCode1
DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation0
Energy Efficient Edge Computing: When Lyapunov Meets Distributed Reinforcement Learning0
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement LearningCode1
FaiR-IoT: Fairness-aware Human-in-the-Loop Reinforcement Learning for Harnessing Human Variability in Personalized IoT0
Towards Real-World Deployment of Reinforcement Learning for Traffic Signal ControlCode1
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity0
Online Policies for Real-Time Control Using MRAC-RL0
Reinforcement learning for optimization of variational quantum circuit architectures0
Deep Reinforcement Learning for Resource Allocation in Business ProcessesCode1
Deep Hedging of Derivatives Using Reinforcement Learning0
Reinforcement Learning Beyond Expectation0
pH-RL: A personalization architecture to bring reinforcement learning to health practice0
Shaping Advice in Deep Multi-Agent Reinforcement LearningCode0
Augmenting Automated Game Testing with Deep Reinforcement Learning0
Joint Resource Management for MC-NOMA: A Deep Reinforcement Learning Approach0
Deep reinforcement learning of event-triggered communication and control for multi-agent cooperative transport0
LASER: Learning a Latent Action Space for Efficient Reinforcement Learning0
Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark0
Robust Reinforcement Learning under model misspecificationCode0
ReAgent: Point Cloud Registration using Imitation and Reinforcement LearningCode1
Self-adaptive Torque Vectoring Controller Using Reinforcement LearningCode0
KnowRU: Knowledge Reusing via Knowledge Distillation in Multi-agent Reinforcement Learning0
Autonomous Overtaking in Gran Turismo Sport Using Curriculum Reinforcement Learning0
Increasing the Efficiency of Policy Learning for Autonomous Vehicles by Multi-Task Representation Learning0
A Convex Programming Approach to Data-Driven Risk-Averse Reinforcement Learning0
Agent with Warm Start and Adaptive Dynamic Termination for Plane Localization in 3D UltrasoundCode1
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlowCode1
Barrier Function-based Safe Reinforcement Learning for Emergency Control of Power Systems0
Character Controllers Using Motion VAEsCode1
Model-Free Learning of Safe yet Effective Controllers0
Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots0
MedSelect: Selective Labeling for Medical Image Classification Combining Meta-Learning with Deep Reinforcement LearningCode1
Nearly Horizon-Free Offline Reinforcement Learning0
Reinforcement Learning for Deceiving Reactive Jammers in Wireless Networks0
AoA-Based Pilot Assignment in Massive MIMO Systems Using Deep Reinforcement Learning0
A Meta-Reinforcement Learning Approach to Process Control0
Improving Playtesting Coverage via Curiosity Driven Reinforcement Learning Agents0
Hierarchical Program-Triggered Reinforcement Learning Agents For Automated Driving0
Self-Imitation Learning by Planning0
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement LearningCode1
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning0
Show:102550
← PrevPage 170 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified