SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 76517700 of 15113 papers

TitleStatusHype
Global Convergence of the ODE Limit for Online Actor-Critic Algorithms in Reinforcement Learning0
Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods0
Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion0
Goal-Conditioned Data Augmentation for Offline Reinforcement Learning0
Goal-conditioned Imitation Learning0
Goal-conditioned Offline Reinforcement Learning through State Space Partitioning0
Goal-Conditioned Reinforcement Learning in the Presence of an Adversary0
Goal-Conditioned Reinforcement Learning with Imagined Subgoals0
Goal-directed Generation of Discrete Structures with Conditional Generative Models0
Goal-Directed Planning by Reinforcement Learning and Active Inference0
Goal-Directed Story Generation: Augmenting Generative Language Models with Reinforcement Learning0
Goal-Driven Sequential Data Abstraction0
Goal-oriented Dialogue Policy Learning from Failures0
Goal-Oriented Next Best Activity Recommendation using Reinforcement Learning0
Goal-oriented Trajectories for Efficient Exploration0
Goal-Oriented Visual Question Generation via Intermediate Rewards0
Goal-Space Planning with Subgoal Models0
GOATS: Goal Sampling Adaptation for Scooping with Curriculum Reinforcement Learning0
Go-Blend behavior and affect0
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning0
Going Beyond Linear RL: Sample Efficient Neural Function Approximation0
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better0
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL0
Good, Better, Best: Textual Distractors Generation for Multiple-Choice Visual Question Answering via Reinforcement Learning0
Government Intervention in Catastrophe Insurance Markets: A Reinforcement Learning Approach0
GraCo -- A Graph Composer for Integrated Circuits0
Gradient-EM Bayesian Meta-learning0
Gradient-Free Neural Network Training via Synaptic-Level Reinforcement Learning0
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction0
Gradient Monitored Reinforcement Learning0
Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning0
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning0
GraMeR: Graph Meta Reinforcement Learning for Multi-Objective Influence Maximization0
Grammar and Gameplay-aligned RL for Game Description Generation with LLMs0
Grammatical Error Correction with Neural Reinforcement Learning0
Granger Causal Interaction Skill Chains0
Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization0
Graph augmented Deep Reinforcement Learning in the GameRLand3D environment0
Graph-based Heuristic Search for Module Selection Procedure in Neural Module Network0
Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery0
GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning0
Graph Convolutional Policy for Solving Tree Decomposition via Reinforcement Learning Heuristics0
Graph Convolutional Reinforcement Learning for Collaborative Queuing Agents0
Graph Decision Transformer0
Graph-enabled Reinforcement Learning for Time Series Forecasting with Adaptive Intelligence0
Graph-Enhanced Exploration for Goal-oriented Reinforcement Learning0
Graph neural induction of value iteration0
Graph Neural Network based Agent in Google Research Football0
Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations0
Graph Neural Networks for Relational Inductive Bias in Vision-based Deep Reinforcement Learning of Robot Control0
Show:102550
← PrevPage 154 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified