SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1102611050 of 15113 papers

TitleStatusHype
Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments0
FuzzerGym: A Competitive Framework for Fuzzing and Learning0
FuzzTheREST: An Intelligent Automated Black-box RESTful API Fuzzer0
Fuzzy Controller of Reward of Reinforcement Learning For Handwritten Digit Recognition0
FuzzyLight: A Robust Two-Stage Fuzzy Approach for Traffic Signal Control Works in Real Cities0
G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning0
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL0
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis0
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations0
GAN-powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing0
GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback0
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning0
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning0
Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization0
Gaussian Process Policy Optimization0
GCN-RL Circuit Designer: Transferable Transistor Sizing with Graph Neural Networks and Reinforcement Learning0
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning0
GDI: Rethinking What Makes Reinforcement Learning Different from Supervised Learning0
Gegelati: Lightweight Artificial Intelligence through Generic and Evolvable Tangled Program Graphs0
Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning0
GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning0
General Intelligence Requires Rethinking Exploration0
Generalisation in Lifelong Reinforcement Learning through Logical Composition0
Generalised Policy Improvement with Geometric Policy Composition0
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data0
Show:102550
← PrevPage 442 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified