SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 52015250 of 15113 papers

TitleStatusHype
A Framework for History-Aware Hyperparameter Optimisation in Reinforcement Learning0
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks0
Computably Continuous Reinforcement-Learning Objectives are PAC-learnable0
Evolving Populations of Diverse RL Agents with MAP-Elites0
Conceptual Reinforcement Learning for Language-Conditioned Tasks0
RACCER: Towards Reachable and Certain Counterfactual Explanations for Reinforcement LearningCode0
Using Memory-Based Learning to Solve Tasks with State-Action Constraints0
MCTS-GEB: Monte Carlo Tree Search is a Good E-graph BuilderCode0
On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples0
Graph Decision Transformer0
Learning When to Treat Business Processes: Prescriptive Process Monitoring with Causal Inference and Reinforcement LearningCode0
adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems0
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning0
Deep Occupancy-Predictive Representations for Autonomous Driving0
Domain Randomization for Robust, Affordable and Effective Closed-loop Control of Soft Robots0
Evolutionary Reinforcement Learning: A Survey0
Decoupling Skill Learning from Robotic Control for Generalizable Object Manipulation0
Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments0
Safe Reinforcement Learning via Probabilistic Logic ShieldsCode0
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning0
Perspectives on the Social Impacts of Reinforcement Learning with Human Feedback0
Dexterous In-hand Manipulation by Guiding Exploration with Simple Sub-skill Controllers0
Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment0
Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control TasksCode0
Sparsity-Aware Intelligent Massive Random Access Control in Open RAN: A Reinforcement Learning Based Approach0
Improved Sample Complexity Bounds for Distributionally Robust Reinforcement LearningCode0
Ensemble Reinforcement Learning: A Survey0
Local Environment Poisoning Attacks on Federated Reinforcement Learning0
Bounding the Optimal Value Function in Compositional Reinforcement LearningCode0
Look-Ahead AC Optimal Power Flow: A Model-Informed Reinforcement Learning Approach0
CFlowNets: Continuous Control with Generative Flow NetworksCode0
Double A3C: Deep Reinforcement Learning on OpenAI Gym Games0
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control0
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning0
Tile Networks: Learning Optimal Geometric Layout for Whole-page Recommendation0
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning0
Guarded Policy Optimization with Imperfect Online Demonstrations0
Intelligent O-RAN Traffic Steering for URLLC Through Deep Reinforcement Learning0
Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning0
Approximating Energy Market Clearing and Bidding With Model-Based Reinforcement Learning0
Learning to Influence Human Behavior with Offline Reinforcement Learning0
Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications0
Domain Adaptation of Reinforcement Learning Agents based on Network Service Proximity0
Data-efficient, Explainable and Safe Box Manipulation: Illustrating the Advantages of Physical Priors in Model-Predictive Control0
Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning0
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement LearningCode0
Compensating for Sensing Failures via Delegation in Human-AI Hybrid Systems0
Parameter Sharing with Network Pruning for Scalable Multi-Agent Deep Reinforcement Learning0
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning0
Reinforcement Learning Guided Multi-Objective Exam Paper GenerationCode0
Show:102550
← PrevPage 105 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified