SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1115111175 of 15113 papers

TitleStatusHype
Goal-Space Planning with Subgoal Models0
GOATS: Goal Sampling Adaptation for Scooping with Curriculum Reinforcement Learning0
Go-Blend behavior and affect0
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning0
Going Beyond Linear RL: Sample Efficient Neural Function Approximation0
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better0
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL0
Good, Better, Best: Textual Distractors Generation for Multiple-Choice Visual Question Answering via Reinforcement Learning0
Government Intervention in Catastrophe Insurance Markets: A Reinforcement Learning Approach0
GraCo -- A Graph Composer for Integrated Circuits0
Gradient-EM Bayesian Meta-learning0
Gradient-Free Neural Network Training via Synaptic-Level Reinforcement Learning0
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction0
Gradient Monitored Reinforcement Learning0
Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning0
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning0
GraMeR: Graph Meta Reinforcement Learning for Multi-Objective Influence Maximization0
Grammar and Gameplay-aligned RL for Game Description Generation with LLMs0
Grammatical Error Correction with Neural Reinforcement Learning0
Granger Causal Interaction Skill Chains0
Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization0
Graph augmented Deep Reinforcement Learning in the GameRLand3D environment0
Graph-based Heuristic Search for Module Selection Procedure in Neural Module Network0
Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery0
GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning0
Show:102550
← PrevPage 447 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified