SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 16511675 of 15113 papers

TitleStatusHype
Offline Reinforcement Learning from Images with Latent Space ModelsCode1
Multi-Decoder Attention Model with Embedding Glimpse for Solving Vehicle Routing ProblemsCode1
Deep Reinforcement Learning for Joint Spectrum and Power Allocation in Cellular NetworksCode1
Generalize a Small Pre-trained Model to Arbitrarily Large TSP InstancesCode1
CityLearn: Standardizing Research in Multi-Agent Reinforcement Learning for Demand Response and Urban Energy ManagementCode1
Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement Learning Painting AgentCode1
High-Throughput Synchronous Deep RLCode1
Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement LearningCode1
Policy Gradient RL Algorithms as Directed Acyclic GraphsCode1
Reinforcement Learning for Contact-Rich Tasks: Robotic Peg Insertion StrategiesCode1
Sim-to-real reinforcement learning applied to end-to-end vehicle controlCode1
An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy SearchCode1
Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman ProblemCode1
Models, Pixels, and Rewards: Evaluating Design Trade-offs in Visual Model-Based Reinforcement LearningCode1
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human EnvironmentsCode1
GAEA: Graph Augmentation for Equitable Access via Reinforcement LearningCode1
Reset-Free Lifelong Learning with Skill-Space PlanningCode1
RLOC: Terrain-Aware Legged Locomotion using Reinforcement Learning and Optimal ControlCode1
ACN-Sim: An Open-Source Simulator for Data-Driven Electric Vehicle Charging ResearchCode1
Revisiting Maximum Entropy Inverse Reinforcement Learning: New Perspectives and AlgorithmsCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Learning Multi-Agent Communication through Structured Attentive ReasoningCode1
Self-supervised Visual Reinforcement Learning with Object-centric RepresentationsCode1
Interactive Machine Learning of Musical GestureCode1
Generalization in Reinforcement Learning by Soft Data AugmentationCode1
Show:102550
← PrevPage 67 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified