SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1075110800 of 15113 papers

TitleStatusHype
Control-Aware Representations for Model-based Reinforcement Learning0
Deep Reinforcement Learning Control for Radar Detection and Tracking in Congested Spectral Environments0
Batch-Constrained Reinforcement Learning for Dynamic Distribution Network Reconfiguration0
Environment Shaping in Reinforcement Learning using State Abstraction0
Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty0
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping0
Online Multi-agent Reinforcement Learning for Decentralized Inverter-based Volt-VAR Control0
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning0
Show me the Way: Intrinsic Motivation from Demonstrations0
On the Relationship Between Active Inference and Control as Inference0
Near-Optimal Reinforcement Learning with Self-Play0
Provably Efficient Causal Reinforcement Learning with Confounded Observational Data0
Sample-Efficient Reinforcement Learning of Undercomplete POMDPs0
QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning0
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret0
Constrained Combinatorial Optimization with Reinforcement Learning0
Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning with Application to Autonomous Driving0
dm_control: Software and Tasks for Continuous Control0
Ecological Reinforcement Learning0
Accelerated Deep Reinforcement Learning Based Load Shedding for Emergency Voltage Control0
Automated Optical Multi-layer Design via Deep Reinforcement LearningCode0
Reinforcement Learning for Mean Field Games with Strategic Complementarities0
Gradient-EM Bayesian Meta-learning0
Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis0
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning0
Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation0
Towards Tractable Optimism in Model-Based Reinforcement Learning0
Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees0
Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies0
Entropic Risk Constrained Soft-Robust Policy Optimization0
Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms0
Learn to Earn: Enabling Coordination within a Ride Hailing Fleet0
A Reinforcement Learning Approach for Transient Control of Liquid Rocket Engines0
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration0
FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimize0
On Reward-Free Reinforcement Learning with Linear Function Approximation0
Provably adaptive reinforcement learning in metric spaces0
WD3: Taming the Estimation Bias in Deep Reinforcement Learning0
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs0
Efficient Ridesharing Dispatch Using Multi-Agent Reinforcement LearningCode0
Cooperative Multi-Agent Reinforcement Learning with Partial Observations0
Distributed Value Function Approximation for Collaborative Multi-Agent Reinforcement Learning0
Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning0
Deep Reinforcement Learning amidst Lifelong Non-Stationarity0
Deep Reinforcement Learning Controller for 3D Path-following and Collision Avoidance by Autonomous Underwater Vehicles0
Delta Schema Network in Model-based Reinforcement LearningCode0
Eco-Vehicular Edge Networks for Connected Transportation: A Distributed Multi-Agent Reinforcement Learning Approach0
Introduction to Machine Learning for Accelerator Physics0
Green Simulation Assisted Reinforcement Learning with Model Risk for Biomanufacturing Learning and ControlCode0
Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework0
Show:102550
← PrevPage 216 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified