SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1065110700 of 15113 papers

TitleStatusHype
Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem0
Reinforcement Learning based Design of Linear Fixed Structure Controllers0
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning0
Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems0
ALLSTEPS: Curriculum-driven Learning of Stepping Stone SkillsCode1
Reinforcement Learning for Thermostatically Controlled Loads Control using Modelica and Python0
Synthesizing Safe Policies under Probabilistic Constraints with Reinforcement Learning and Bayesian Model Checking0
Is Deep Reinforcement Learning Ready for Practical Applications in Healthcare? A Sensitivity Analysis of Duel-DDQN for Hemodynamic Management in Sepsis Patients0
Learning hierarchical behavior and motion planning for autonomous drivingCode1
Reinforcement Learning with Feedback Graphs0
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document SummarizationCode1
Plan2Vec: Unsupervised Representation Learning by Latent PlansCode1
Curious Hierarchical Actor-Critic Reinforcement LearningCode1
CARL: Controllable Agent with Reinforcement Learning for Quadruped LocomotionCode1
Adaptive Dialog Policy Learning with Hindsight and User Modeling0
Safe Reinforcement Learning through Meta-learned Instincts0
Robotic Arm Control and Task Training through Deep Reinforcement Learning0
Gifting in multi-agent reinforcement learningCode0
Reinforcement Learning for UAV Autonomous Navigation, Mapping and Target Detection0
A Survey on Dialog Management: Recent Advances and Challenges0
Discrete-to-Deep Supervised Policy LearningCode0
Generalized Planning With Deep Reinforcement Learning0
Formal Policy Synthesis for Continuous-Space Systems via Reinforcement Learning0
Generalized Reinforcement Meta Learning for Few-Shot Optimization0
Hierarchical Decomposition of Nonlinear Dynamics and Control for System Identification and Policy Distillation0
Setting up experimental Bell test with reinforcement learning0
Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning0
Reward Constrained Interactive Recommendation with Natural Language Feedback0
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open ProblemsCode1
Multiagent Value Iteration Algorithms in Dynamic Programming and Reinforcement Learning0
Off-Policy Adversarial Inverse Reinforcement LearningCode1
Multi-agent Reinforcement Learning for Decentralized Stable Matching0
Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach0
Enhancing Text-based Reinforcement Learning Agents with Commonsense Knowledge0
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey0
Exploration in Reinforcement Learning with Deep Covering Options0
Learning Efficient Parameter Server Synchronization Policies for Distributed SGD0
Learning the Arrow of Time for Problems in Reinforcement Learning0
Learning Heuristics for Quantified Boolean Formulas through Reinforcement Learning0
Implementation Matters in Deep RL: A Case Study on PPO and TRPOCode1
Explain Your Move: Understanding Agent Actions Using Focused Feature SaliencyCode0
Deep Symbolic Superoptimization Without Human KnowledgeCode1
Option Discovery using Deep Skill ChainingCode1
Model Based Reinforcement Learning for Atari0
RaCT: Toward Amortized Ranking-Critical Training For Collaborative FilteringCode1
The Ingredients of Real World Robotic Reinforcement Learning0
Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information0
Logic and the 2-Simplicial TransformerCode1
Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control0
Model-based reinforcement learning for biological sequence design0
Show:102550
← PrevPage 214 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified