SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1320113250 of 15113 papers

TitleStatusHype
Policy Certificates: Towards Accountable Reinforcement Learning0
Baselines for Reinforcement Learning in Text Games0
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search0
Deep Reinforcement Learning via L-BFGS Optimization0
Towards continual learning in medical imaging0
Adaptive Stress Testing: Finding Likely Failure Events with Reinforcement Learning0
Deep Reinforcement Learning for Green Security Games with Real-Time Information0
A Biologically Plausible Learning Rule for Deep Learning in the BrainCode0
Contingency-Aware Exploration in Reinforcement Learning0
Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder0
Managing engineering systems with large state and action spaces through deep reinforcement learning0
QUOTA: The Quantile Option Architecture for Reinforcement LearningCode0
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks0
Reinforcement Learning based Dynamic Model Selection for Short-Term Load Forecasting0
Bayesian Action Decoder for Deep Multi-Agent Reinforcement LearningCode1
Relation Mention Extraction from Noisy Data with Hierarchical Reinforcement Learning0
VIREL: A Variational Inference Framework for Reinforcement LearningCode0
Sequence Generation with Guider Network0
Automated Theorem Proving in Intuitionistic Propositional Logic by Deep Reinforcement Learning0
Dantzig Selector with an Approximately Optimal Denoising Matrix and its Application to Reinforcement Learning0
Shaping a social robot's humor with Natural Language Generation and socially-aware reinforcement learning0
Macquarie University at BioASQ 6b: Deep learning and deep reinforcement learning for query-based summarisation0
Joint Modeling for Query Expansion and Information Extraction with Reinforcement Learning0
Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning0
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform0
Temporal Regularization in Markov Decision ProcessCode0
SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning0
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning0
Relative Importance Sampling for off-Policy Actor-Critic in Deep Reinforcement Learning0
Reinforcement Learning and Deep Learning based Lateral Control for Autonomous DrivingCode0
Exploration by Random Network DistillationCode1
Gated Hierarchical Attention for Image CaptioningCode1
Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions ModelingCode1
Assessing Generalization in Deep Reinforcement LearningCode0
Model-Based Active ExplorationCode1
Social Vehicle Swarms: A Novel Perspective on Social-aware Vehicular Communication Architecture0
DQN-TAMER: Human-in-the-Loop Reinforcement Learning with Intractable FeedbackCode0
Distributive Dynamic Spectrum Access through Deep Reinforcement Learning: A Reservoir Computing Based Approach0
Learn to Steer through Deep Reinforcement LearningCode0
Multi-Agent Common Knowledge Reinforcement LearningCode0
Neural Modular Control for Embodied Question AnsweringCode0
Transfer of Deep Reactive Policies for MDP PlanningCode0
Stability-certified reinforcement learning: A control-theoretic perspective0
Empirical Evaluation of Contextual Policy Search with a Comparison-based Surrogate Model and Active Covariance Matrix Adaptation0
Differential Variable Speed Limits Control for Freeway Recurrent Bottlenecks via Deep Reinforcement learning0
Multi-Agent Reinforcement Learning Based Resource Allocation for UAV Networks0
Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions0
Inverse reinforcement learning for video gamesCode0
Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich TasksCode1
Learning Representations in Model-Free Hierarchical Reinforcement Learning0
Show:102550
← PrevPage 265 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified