SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 80768100 of 15113 papers

TitleStatusHype
Average-Reward Reinforcement Learning with Trust Region Methods0
Explainable Artificial Intelligence (XAI) for Increasing User Trust in Deep Reinforcement Learning Driven Autonomous Systems0
Learning without Knowing: Unobserved Context in Continuous Transfer Reinforcement Learning0
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement LearningCode1
DisTop: Discovering a Topological representation to learn diverse and rewarding skills0
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning0
Efficient Continuous Control with Double Actors and Regularized CriticsCode1
Distributional Reinforcement Learning with Unconstrained Monotonic Neural NetworksCode1
3D UAV Trajectory and Data Collection Optimisation via Deep Reinforcement Learning0
Control-Oriented Model-Based Reinforcement Learning with Implicit DifferentiationCode1
ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learningCode1
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement LearningCode1
Learning Routines for Effective Off-Policy Reinforcement Learning0
Heuristic-Guided Reinforcement Learning0
Same State, Different Task: Continual Reinforcement Learning without InterferenceCode1
Reinforcement Learning for Assignment Problem with Time Constraints0
Resource Allocation in Disaggregated Data Centre Systems with Reinforcement Learning0
Online reinforcement learning with sparse rewards through an active inference capsuleCode1
Model-agnostic and Scalable Counterfactual Explanations via Reinforcement LearningCode2
Differentiable Architecture Search for Reinforcement LearningCode1
Robustifying Reinforcement Learning Policies with L_1 Adaptive Control0
Detecting and Adapting to Novelty in Games0
Be Considerate: Objectives, Side Effects, and Deciding How to Act0
Cross-Trajectory Representation Learning for Zero-Shot Generalization in RLCode0
Celebrating Diversity in Shared Multi-Agent Reinforcement LearningCode1
Show:102550
← PrevPage 324 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified