SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 60516075 of 15113 papers

TitleStatusHype
Investigation of Factorized Optical Flows as Mid-Level Representations0
Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks0
Multi-Agent Broad Reinforcement Learning for Intelligent Traffic Light Control0
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning0
A Complete Characterization of Linear Estimators for Offline Policy Evaluation0
Curriculum-based Reinforcement Learning for Distribution System Critical Load RestorationCode1
Designing Heterogeneous GNNs with Desired Permutation Properties for Wireless Resource Allocation0
Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping0
Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery0
Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models0
Robot Learning of Mobile Manipulation with Reachability Behavior Priors0
A Survey on Reinforcement Learning Methods in Character Animation0
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets0
Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation0
Deep Reinforcement Learning for Entity AlignmentCode1
Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations0
Influencing Long-Term Behavior in Multiagent Reinforcement LearningCode1
Efficient Policy Generation in Multi-Agent Systems via Hypergraph Neural Network0
Knowledge Transfer in Deep Reinforcement Learning for Slice-Aware Mobility Robustness Optimization0
Cascaded Gaps: Towards Gap-Dependent Regret for Risk-Sensitive Reinforcement Learning0
Scalable multi-agent reinforcement learning for distributed control of residential energy flexibility0
Reliably Re-Acting to Partner's Actions with the Social Intrinsic Motivation of Transfer EmpowermentCode1
Reinforcement Learning for Location-Aware Scheduling0
On Credit Assignment in Hierarchical Reinforcement LearningCode0
Black-Box Safety Validation of Autonomous Systems: A Multi-Fidelity Reinforcement Learning Approach0
Show:102550
← PrevPage 243 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified