SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 48014825 of 15113 papers

TitleStatusHype
SAFER: Safe Collision Avoidance using Focused and Efficient Trajectory Search with Reinforcement Learning0
On Efficient Reinforcement Learning for Full-length Game of StarCraft IICode2
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning0
Unified Algorithms for RL with Decision-Estimation Coefficients: PAC, Reward-Free, Preference-Based Learning, and Beyond0
Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning0
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement LearningCode0
Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation0
Reinforcement Learning in Computing and Network Convergence Orchestration0
Computational Discovery of Energy-Efficient Heat Treatment for Microstructure Design using Deep Reinforcement Learning0
Identifiability and generalizability from multiple experts in Inverse Reinforcement LearningCode0
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments0
An Investigation of the Bias-Variance Tradeoff in Meta-GradientsCode0
Hierarchical Decentralized Deep Reinforcement Learning Architecture for a Simulated Four-Legged AgentCode0
Learning from Symmetry: Meta-Reinforcement Learning with Symmetrical Behaviors and Language Instructions0
ECSAS: Exploring Critical Scenarios from Action Sequence in Autonomous Driving0
Evaluation of Look-ahead Economic Dispatch Using Reinforcement Learning0
LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement LearningCode1
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games0
Hierarchical Decision Transformer0
Performance Optimization for Variable Bitwidth Federated Learning in Wireless Networks0
Revisiting Discrete Soft Actor-CriticCode1
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies0
Model-Free Reinforcement Learning for Asset Allocation0
Towards Task-Prioritized Policy Composition0
Optimizing Crop Management with Reinforcement Learning and Imitation Learning0
Show:102550
← PrevPage 193 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified