SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 10011050 of 15113 papers

TitleStatusHype
Aerial View Localization with Reinforcement Learning: Towards Emulating Search-and-RescueCode1
Hearts Gym: Learning Reinforcement Learning as a Team EventCode1
Actor Prioritized Experience ReplayCode1
Cell-Free Latent Go-ExploreCode1
Style-Agnostic Reinforcement LearningCode1
Rethinking Conversational Recommendations: Is Decision Tree All You Need?Code1
Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement LearningCode1
Towards Automated Imbalanced Learning with Deep Hierarchical Reinforcement LearningCode1
Light-weight probing of unsupervised representations for Reinforcement LearningCode1
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous DrivingCode1
Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement LearningCode1
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning AlgorithmCode1
Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraftCode1
Towards Sequence-Level Training for Visual TrackingCode1
Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement LearningCode1
A Modular Framework for Reinforcement Learning Optimal ExecutionCode1
Robust Reinforcement Learning using Offline DataCode1
Object Detection with Deep Reinforcement LearningCode1
Automating DBSCAN via Deep Reinforcement LearningCode1
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past ExperienceCode1
From Scratch to Sketch: Deep Decoupled Hierarchical Reinforcement Learning for Robotic Sketching AgentCode1
Mobility-Aware Cooperative Caching in Vehicular Edge Computing Based on Asynchronous Federated and Deep Reinforcement LearningCode1
Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Tasks with Sparse RewardsCode1
Performance Comparison of Deep RL Algorithms for Energy Systems Optimal SchedulingCode1
Model-based graph reinforcement learning for inductive traffic signal controlCode1
Unified Automatic Control of Vehicular Systems with Reinforcement LearningCode1
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement LearningCode1
Lifelong Machine Learning of Functionally Compositional StructuresCode1
Learning Soccer Juggling Skills with Layer-wise Mixture-of-ExpertsCode1
Hierarchical Kickstarting for Skill Transfer in Reinforcement LearningCode1
Driver Dojo: A Benchmark for Generalizable Reinforcement Learning for Autonomous DrivingCode1
Robust Knowledge Adaptation for Dynamic Graph Neural NetworksCode1
Reinforcement learning for Energies of the future and carbon neutrality: a Challenge DesignCode1
Discriminator-Weighted Offline Imitation Learning from Suboptimal DemonstrationsCode1
Deep Reinforcement Learning for Market Making Under a Hawkes Process-Based Limit Order Book ModelCode1
Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal ReasoningCode1
Bayesian Generational Population-Based TrainingCode1
A Meta-Reinforcement Learning Algorithm for Causal DiscoveryCode1
Active Exploration for Inverse Reinforcement LearningCode1
Asset Allocation: From Markowitz to Deep Reinforcement LearningCode1
A General Contextualized Rewriting Framework for Text SummarizationCode1
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement LearningCode1
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy OptimizationCode1
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement LearningCode1
Reinforced Lin-Kernighan-Helsgaun Algorithms for the Traveling Salesman ProblemsCode1
Interaction Pattern Disentangling for Multi-Agent Reinforcement LearningCode1
CompoSuite: A Compositional Reinforcement Learning BenchmarkCode1
Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse ManagementCode1
A Learning System for Motion Planning of Free-Float Dual-Arm Space Manipulator towards Non-Cooperative ObjectCode1
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICsCode1
Show:102550
← PrevPage 21 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified