SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 54015450 of 15113 papers

TitleStatusHype
FreeKD: Free-direction Knowledge Distillation for Graph Neural Networks0
Defending Observation Attacks in Deep Reinforcement Learning via Detection and DenoisingCode0
Deep Reinforcement Learning for Exact Combinatorial Optimization: Learning to Branch0
Transformers are Meta-Reinforcement LearnersCode1
RoSGAS: Adaptive Social Bot Detection with Reinforced Self-Supervised GNN Architecture SearchCode1
Universally Expressive Communication in Multi-Agent Reinforcement LearningCode0
Solving the capacitated vehicle routing problem with timing windows using rollouts and MAX-SAT0
Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization0
Open-Ended Learning Strategies for Learning Complex Locomotion Skills0
Visual Radial Basis Q-Network0
Robust Reinforcement Learning with Distributional Risk-averse formulation0
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning0
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward0
Reinforcement Learning-based Placement of Charging Stations in Urban Road NetworksCode1
Provable Benefit of Multitask Representation Learning in Reinforcement Learning0
IGN : Implicit Generative NetworksCode0
Computation Offloading and Resource Allocation in F-RANs: A Federated Deep Reinforcement Learning Approach0
Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks0
Intrinsically motivated option learning: a comparative study of recent methods0
Relative Policy-Transition Optimization for Fast Policy Transfer0
RL-GA: A Reinforcement Learning-Based Genetic Algorithm for Electromagnetic Detection Satellite Scheduling Problem0
Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning0
Case-Based Inverse Reinforcement Learning Using Temporal CoherenceCode0
Deep Reinforcement Learning for Optimal Investment and Saving Strategy Selection in Heterogeneous Profiles: Intelligent Agents working towards retirement0
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum GamesCode1
Federated Offline Reinforcement Learning0
An application of neural networks to a problem in knot theory and group theory (untangling braids)0
Large-Scale Retrieval for Reinforcement Learning0
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement LearningCode0
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?Code0
Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based on Maximum Entropy0
ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement LearningCode1
Policy Gradient Reinforcement Learning for Uncertain Polytopic LPV Systems based on MHE-MPC0
Multifidelity Reinforcement Learning with Control Variates0
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS0
Regret Bounds for Information-Directed Reinforcement Learning0
Mildly Conservative Q-Learning for Offline Reinforcement LearningCode1
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement LearningCode1
Regret Analysis of Certainty Equivalence Policies in Continuous-Time Linear-Quadratic Systems0
Receding Horizon Inverse Reinforcement Learning0
There is no Accuracy-Interpretability Tradeoff in Reinforcement Learning for Mazes0
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-RiskCode1
Overcoming the Spectral Bias of Neural Value Approximation0
Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information0
Quantum Policy Iteration via Amplitude Estimation and Grover Search -- Towards Quantum Advantage for Reinforcement Learning0
Challenges and Opportunities in Offline Reinforcement Learning from Visual ObservationsCode2
Deep Surrogate Assisted Generation of Environments0
An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems0
A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement LearningCode1
Reinforced Inverse Scattering0
Show:102550
← PrevPage 109 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified