SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 40014050 of 15113 papers

TitleStatusHype
Accelerating the Computation of UCB and Related Indices for Reinforcement Learning0
Accelerating the Learning of TAMER with Counterfactual Explanations0
Accelerating Training in Pommerman with Imitation and Reinforcement Learning0
Acceleration of Actor-Critic Deep Reinforcement Learning for Visual Grasping in Clutter by State Representation Learning Based on Disentanglement of a Raw Input Image0
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning0
Accidental exploration through value predictors0
ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning0
Accounting for the Sequential Nature of States to Learn Features for Reinforcement Learning0
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning0
ACDER: Augmented Curiosity-Driven Experience Replay0
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search0
ACECODER: Acing Coder RL via Automated Test-Case Synthesis0
A centralized reinforcement learning method for multi-agent job scheduling in Grid0
ACERAC: Efficient reinforcement learning in fine time discretization0
AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy0
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning0
ACES -- Automatic Configuration of Energy Harvesting Sensors with Reinforcement Learning0
Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning0
Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning0
Achieving Real-Time LiDAR 3D Object Detection on a Mobile Device0
Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling0
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach0
A Closer Look at Reward Decomposition for High-Level Robotic Explanations0
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning0
A Coarse to Fine Question Answering System based on Reinforcement Learning0
A Cognitive Architecture Based on a Learning Classifier System with Spiking Classifiers0
A Collaborative Multi-agent Reinforcement Learning Framework for Dialog Action Decomposition0
A note on stabilizing reinforcement learning0
A Communication-Efficient Multi-Agent Actor-Critic Algorithm for Distributed Reinforcement Learning0
A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles0
A Comparative Analysis of Expected and Distributional Reinforcement Learning0
A Comparative Analysis of Machine Learning Techniques for IoT Intrusion Detection0
A Comparative Analysis of Reinforcement Learning and Conventional Deep Learning Approaches for Bearing Fault Diagnosis0
A comparative evaluation of machine learning methods for robot navigation through human crowds0
A Comparative Study of AI-based Intrusion Detection Techniques in Critical Infrastructures0
A Comparative Study of Deep Reinforcement Learning for Crop Production Management0
A Comparative Study of Reinforcement Learning Techniques on Dialogue Management0
A Comparison of Action Spaces for Learning Manipulation Tasks0
A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control0
A comparison of controller architectures and learning mechanisms for arbitrary robot morphologies0
A Comparison of learning algorithms on the Arcade Learning Environment0
A Comparison of Prediction Algorithms and Nexting for Short Term Weather Forecasts0
A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling0
A Comparison of Self-Play Algorithms Under a Generalized Framework0
A Complementary Learning Systems Approach to Temporal Difference Learning0
A comprehensive survey of research towards AI-enabled unmanned aerial systems in pre-, active-, and post-wildfire management0
A Computational Framework for Motor Skill Acquisition0
A Computational Model of Representation Learning in the Brain Cortex, Integrating Unsupervised and Reinforcement Learning0
A Conceptual Framework for Externally-influenced Agents: An Assisted Reinforcement Learning Review0
A Concise Introduction to Reinforcement Learning in Robotics0
Show:102550
← PrevPage 81 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified