SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 76267650 of 15113 papers

TitleStatusHype
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce0
Genetic Algorithm enhanced by Deep Reinforcement Learning in parent selection mechanism and mutation : Minimizing makespan in permutation flow shop scheduling problems0
Genetic Drift Regularization: on preventing Actor Injection from breaking Evolution Strategies0
Genetic-Gated Networks for Deep Reinforcement0
Genetic-Gated Networks for Deep Reinforcement Learning0
Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems0
Genetic Soft Updates for Policy Evolution in Deep Reinforcement Learning0
GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning0
GenTUS: Simulating User Behaviour and Language in Task-oriented Dialogues with Generative Transformers0
Geometric Active Exploration in Markov Decision Processes: the Benefit of Abstraction0
Geometrically Coupled Monte Carlo Sampling0
Geometric Entropic Exploration0
Geometric Multi-Model Fitting by Deep Reinforcement Learning0
Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning0
Getting By Goal Misgeneralization With a Little Help From a Mentor0
GFlowNet Fine-tuning for Diverse Correct Solutions in Mathematical Reasoning Tasks0
GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks0
GitFL: Adaptive Asynchronous Federated Learning using Version Control0
GitGraph - Architecture Search Space Creation through Frequent Computational Subgraph Mining0
GITSR: Graph Interaction Transformer-based Scene Representation for Multi Vehicle Collaborative Decision-making0
A Simulation Environment and Reinforcement Learning Method for Waste Reduction0
G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning0
GLiDE: Generalizable Quadrupedal Locomotion in Diverse Environments with a Centroidal Model0
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL0
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning0
Show:102550
← PrevPage 306 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified