SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 14761500 of 15113 papers

TitleStatusHype
Forgetful Experience Replay in Hierarchical Reinforcement Learning from DemonstrationsCode1
Lyapunov-Regularized Reinforcement Learning for Power System Transient StabilityCode1
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past ExperienceCode1
Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning ApproachCode1
Deep Reinforcement Learning for Process SynthesisCode1
Barrier Certified Safety Learning Control: When Sum-of-Square Programming Meets Reinforcement LearningCode1
Deep Reinforcement Learning For Sequence to Sequence ModelsCode1
A General Contextualized Rewriting Framework for Text SummarizationCode1
Deep Reinforcement Learning for Real-Time Optimization of Pumps in Water Distribution SystemsCode1
Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement LearningCode1
Deep Reinforcement Learning for Resource Allocation in Business ProcessesCode1
Batch Exploration with Examples for Scalable Robotic Reinforcement LearningCode1
Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing ProblemCode1
Frame Mining: a Free Lunch for Learning Robotic Manipulation from 3D Point CloudsCode1
ManiSkill2: A Unified Benchmark for Generalizable Manipulation SkillsCode1
GAEA: Graph Augmentation for Equitable Access via Reinforcement LearningCode1
Deep Reinforcement Learning for Turbulence Modeling in Large Eddy SimulationsCode1
Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly DataCode1
Deep Reinforcement Learning for URLLC data management on top of scheduled eMBB trafficCode1
Accelerating Robot Learning of Contact-Rich Manipulations: A Curriculum Learning StudyCode1
A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing ProblemsCode1
Deep Reinforcement Learning from Self-Play in Imperfect-Information GamesCode1
FlapAI Bird: Training an Agent to Play Flappy Bird Using Reinforcement Learning TechniquesCode1
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity RewardsCode1
First return, then exploreCode1
Show:102550
← PrevPage 60 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified