SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 37013725 of 15113 papers

TitleStatusHype
Sim-and-Real Reinforcement Learning for Manipulation: A Consensus-based Approach0
Limited Query Graph Connectivity Test0
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors0
Exponential Hardness of Reinforcement Learning with Linear Function Approximation0
On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process0
Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning0
The Dormant Neuron Phenomenon in Deep Reinforcement LearningCode6
Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs0
GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual ExplanationsCode1
GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification0
Neural Laplace Control for Continuous-time Delayed SystemsCode1
Model-Based Uncertainty in Value FunctionsCode1
EvoTorch: Scalable Evolutionary Computation in PythonCode3
Multi-Agent Reinforcement Learning with Common Policy for Antenna Tilt Optimization0
Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains0
AC2C: Adaptively Controlled Two-Hop Communication for Multi-Agent Reinforcement Learning0
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationCode0
Energy Harvesting Reconfigurable Intelligent Surface for UAV Based on Robust Deep Reinforcement LearningCode1
Diverse Policy Optimization for Structured Action SpaceCode1
To the Noise and Back: Diffusion for Shared Autonomy0
Concept Learning for Interpretable Multi-Agent Reinforcement Learning0
Reinforcement Learning for Combining Search Methods in the Calibration of Economic ABMsCode1
Behavior Proximal Policy OptimizationCode1
Towards Decentralized Predictive Quality of Service in Next-Generation Vehicular Networks0
Self-supervised network distillation: an effective approach to exploration in sparse reward environmentsCode0
Show:102550
← PrevPage 149 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified