SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 34013450 of 15113 papers

TitleStatusHype
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning ResearchCode3
Deep Reinforcement Learning-based Exploration of Web ApplicationsCode0
What Matters in Reinforcement Learning for TractographyCode1
Attention-based QoE-aware Digital Twin Empowered Edge Computing for Immersive Virtual Reality0
Task-Oriented Communication Design at Scale0
Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension0
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs0
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes0
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback0
Multi-Agent Reinforcement Learning Resources Allocation Method Using Dueling Double Deep Q-Network in Vehicular NetworksCode0
Towards Generalizable Reinforcement Learning for Trade Execution0
Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient AlgorithmsCode0
On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm0
Optimizing Memory Mapping Using Deep Reinforcement Learning0
Discovery of Optimal Quantum Error Correcting Codes via Reinforcement Learning0
Supplementing Gradient-Based Reinforcement Learning with Simple Evolutionary Ideas0
An Option-Dependent Analysis of Regret Minimization Algorithms in Finite-Horizon Semi-Markov Decision Processes0
Optimal Energy System Scheduling Using A Constraint-Aware Reinforcement Learning AlgorithmCode1
Assessment of Reinforcement Learning Algorithms for Nuclear Power Plant Fuel Optimization0
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection0
Policy Gradient Methods in the Presence of Symmetries and State AbstractionsCode1
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement LearningCode1
RLocator: Reinforcement Learning for Bug Localization0
Information Design in Multi-Agent Reinforcement LearningCode1
Knowledge-enhanced Agents for Interactive Text Games0
Reinforcement Learning for Topic ModelsCode0
Truncating Trajectories in Monte Carlo Reinforcement Learning0
Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization0
Explaining RL Decisions with TrajectoriesCode0
How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 2: Method and Applications0
Rethinking Population-assisted Off-policy Reinforcement Learning0
Federated Ensemble-Directed Offline Reinforcement LearningCode1
Toward Evaluating Robustness of Reinforcement Learning with Adversarial PolicyCode0
How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 1: A Paradigmatic Theory0
Simple Noisy Environment Augmentation for Reinforcement LearningCode0
Explainable Reinforcement Learning via a Causal World ModelCode1
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender SystemsCode0
Gym-preCICE: Reinforcement Learning Environments for Active Flow Control0
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in HealthcareCode1
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality GuaranteesCode0
Validation of massively-parallel adaptive testing using dynamic control matching0
An Autonomous Non-monolithic Agent with Multi-mode Exploration based on Options FrameworkCode0
Online Portfolio Management via Deep Reinforcement Learning with High-Frequency DataCode1
A Transfer Learning Approach to Minimize Reinforcement Learning Risks in Energy Optimization for Smart Buildings0
Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning0
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs TransformationCode1
A Federated Reinforcement Learning Framework for Link Activation in Multi-link Wi-Fi Networks0
One-Step Distributional Reinforcement Learning0
Multi-criteria Hardware Trojan Detection: A Reinforcement Learning Approach0
CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation ProcessingCode0
Show:102550
← PrevPage 69 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified