SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 67016725 of 15113 papers

TitleStatusHype
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration0
RMIX: Learning Risk-Sensitive Policies forCooperative Reinforcement Learning Agents0
Weighted model estimation for offline model-based reinforcement learning0
Regularized Softmax Deep Multi-Agent Q-LearningCode1
Offline Model-based Adaptable Policy LearningCode1
BooVI: Provably Efficient Bootstrapped Value Iteration0
CO-PILOT: COllaborative Planning and reInforcement Learning On sub-Task curriculumCode0
Learning Tree Interpretation from Object Representation for Deep Reinforcement Learning0
Curriculum Offline Imitating Learning0
BCORLE(): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce MarketCode1
Counterexample Guided RL Policy Refinement Using Bayesian OptimizationCode0
Distributionally Robust Imitation Learning0
Explicable Reward Design for Reinforcement Learning AgentsCode0
EDGE: Explaining Deep Reinforcement Learning PoliciesCode1
Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement LearningCode1
Energy-Efficient Design for a NOMA assisted STAR-RIS Network with Deep Reinforcement Learning0
Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning0
Model-Free μ Synthesis via Adversarial Reinforcement Learning0
The Power of Communication in a Distributed Multi-Agent System0
MAMRL: Exploiting Multi-agent Meta Reinforcement Learning in WAN Traffic Engineering0
Continuous Control With Ensemble Deep Deterministic Policy GradientsCode0
DeepCQ+: Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks0
Multi-UAV Conflict Resolution with Graph Convolutional Reinforcement Learning0
Reinforcement Learning Algorithm for Traffic Steering in Heterogeneous Network0
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement LearningCode0
Show:102550
← PrevPage 269 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified