SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1035110400 of 15113 papers

TitleStatusHype
Accelerated Deep Reinforcement Learning Based Load Shedding for Emergency Voltage Control0
Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning with Application to Autonomous Driving0
dm_control: Software and Tasks for Continuous Control0
Graph Neural Networks and Reinforcement Learning for Behavior Generation in Semantic EnvironmentsCode1
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret0
Sample-Efficient Reinforcement Learning of Undercomplete POMDPs0
QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning0
Safe Reinforcement Learning via Curriculum InductionCode1
Provably Efficient Causal Reinforcement Learning with Confounded Observational Data0
Near-Optimal Reinforcement Learning with Self-Play0
Learning with AMIGo: Adversarially Motivated Intrinsic GoalsCode1
Ecological Reinforcement Learning0
Constrained Combinatorial Optimization with Reinforcement Learning0
Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis0
Reinforcement Learning for Mean Field Games with Strategic Complementarities0
Gradient-EM Bayesian Meta-learning0
Automated Optical Multi-layer Design via Deep Reinforcement LearningCode0
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning0
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement LearningCode1
Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation0
Towards Tractable Optimism in Model-Based Reinforcement Learning0
Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees0
Entropic Risk Constrained Soft-Robust Policy Optimization0
Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies0
Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms0
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement LearningCode1
Deep Implicit Coordination Graphs for Multi-agent Reinforcement LearningCode1
FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimize0
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian ProcessesCode1
On Reward-Free Reinforcement Learning with Linear Function Approximation0
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration0
A Reinforcement Learning Approach for Transient Control of Liquid Rocket Engines0
Learn to Earn: Enabling Coordination within a Ride Hailing Fleet0
WD3: Taming the Estimation Bias in Deep Reinforcement Learning0
Provably adaptive reinforcement learning in metric spaces0
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement LearningCode1
Cooperative Multi-Agent Reinforcement Learning with Partial Observations0
Efficient Ridesharing Dispatch Using Multi-Agent Reinforcement LearningCode0
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs0
Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning0
Deep Reinforcement Learning amidst Lifelong Non-Stationarity0
Distributed Value Function Approximation for Collaborative Multi-Agent Reinforcement Learning0
DREAM: Deep Regret minimization with Advantage baselines and Model-free learningCode1
Learning Invariant Representations for Reinforcement Learning without ReconstructionCode1
Converting Biomechanical Models from OpenSim to MuJoCoCode1
Eco-Vehicular Edge Networks for Connected Transportation: A Distributed Multi-Agent Reinforcement Learning Approach0
Green Simulation Assisted Reinforcement Learning with Model Risk for Biomanufacturing Learning and ControlCode0
Introduction to Machine Learning for Accelerator Physics0
Learning to Track Dynamic Targets in Partially Known EnvironmentsCode1
Deep Reinforcement Learning Controller for 3D Path-following and Collision Avoidance by Autonomous Underwater Vehicles0
Show:102550
← PrevPage 208 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified