SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 13761400 of 15113 papers

TitleStatusHype
A Reinforcement Learning Environment for Mathematical Reasoning via Program SynthesisCode1
Forgetful Experience Replay in Hierarchical Reinforcement Learning from DemonstrationsCode1
A Reinforcement Learning Environment For Job-Shop SchedulingCode1
Computational Performance of Deep Reinforcement Learning to find Nash EquilibriaCode1
Frame Mining: a Free Lunch for Learning Robotic Manipulation from 3D Point CloudsCode1
From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement LearningCode1
From Scratch to Sketch: Deep Decoupled Hierarchical Reinforcement Learning for Robotic Sketching AgentCode1
Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman ProblemCode1
Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning FrameworkCode1
Future-conditioned Unsupervised Pretraining for Decision TransformerCode1
Combining Reinforcement Learning and Constraint Programming for Combinatorial OptimizationCode1
Gamma and Vega Hedging Using Deep Distributional Reinforcement LearningCode1
GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual ExplanationsCode1
Aerial View Localization with Reinforcement Learning: Towards Emulating Search-and-RescueCode1
A reinforcement learning path planning approach for range-only underwater target localization with autonomous vehiclesCode1
Sample Efficient Reinforcement Learning via Large Vision Language Model DistillationCode1
Generalizable Visual Reinforcement Learning with Segment Anything ModelCode1
Generalization in Reinforcement Learning by Soft Data AugmentationCode1
A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement LearningCode1
Combining Reinforcement Learning with Model Predictive Control for On-Ramp MergingCode1
Learning to combine primitive skills: A step towards versatile robotic manipulationCode1
Combining Deep Reinforcement Learning and Search for Imperfect-Information GamesCode1
Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal ReasoningCode1
Combining Modular Skills in Multitask LearningCode1
Reinforcement Learning for Combining Search Methods in the Calibration of Economic ABMsCode1
Show:102550
← PrevPage 56 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified