SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 36513675 of 15113 papers

TitleStatusHype
General Policy Evaluation and Improvement by Learning to Identify Few But Crucial StatesCode0
Ranking Policy GradientCode0
Approximate Model-Based Shielding for Safe Reinforcement LearningCode0
General policy mapping: online continual reinforcement learning inspired on the insect brainCode0
Approximately Optimal Search on a Higher-dimensional Sliding PuzzleCode0
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement LearningCode0
CODEX: A Cluster-Based Method for Explainable Reinforcement LearningCode0
Generalized Phase Pressure Control Enhanced Reinforcement Learning for Traffic Signal ControlCode0
Generalized Speedy Q-learningCode0
Generative Planning for Temporally Coordinated Exploration in Reinforcement LearningCode0
A Lyapunov-based Approach to Safe Reinforcement LearningCode0
Real-time Adversarial Perturbations against Deep Reinforcement Learning Policies: Attacks and DefensesCode0
Active Advantage-Aligned Online Reinforcement Learning with Offline DataCode0
Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task LearningCode0
Generalization in Text-based Games via Hierarchical Reinforcement LearningCode0
COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven ExplorationCode0
Generalization in Visual Reinforcement Learning with the Reward Sequence DistributionCode0
Deep Reinforcement Learning for Multi-Domain Dialogue SystemsCode0
Generalization in Reinforcement Learning with Selective Noise Injection and Information BottleneckCode0
Generalization through Simulation: Integrating Simulated and Real Data into Deep Reinforcement Learning for Vision-Based Autonomous FlightCode0
Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across DomainsCode0
CoaCor: Code Annotation for Code Retrieval with Reinforcement LearningCode0
Autonomous Management of Energy-Harvesting IoT Nodes Using Deep Reinforcement LearningCode0
ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement LearningCode0
Attention-Based Reward Shaping for Sparse and Delayed RewardsCode0
Show:102550
← PrevPage 147 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified