SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1350113550 of 15113 papers

TitleStatusHype
A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators0
Neural Math Word Problem Solver with Reinforcement Learning0
Structured Dialogue Policy with Graph Neural Networks0
Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless NetworksCode0
Learning Dexterous In-Hand Manipulation0
Count-Based Exploration with the Successor RepresentationCode0
Learning to Interrupt: A Hierarchical Deep Reinforcement Learning Framework for Efficient Exploration0
Improving Spatiotemporal Self-Supervision by Deep Reinforcement Learning0
Optimal Tap Setting of Voltage Regulation Transformers Using Batch Reinforcement Learning0
Multi-Agent Generative Adversarial Imitation LearningCode1
Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning0
A Reinforcement Learning Approach to Target Tracking in a Camera Network0
Backprop-Q: Generalized Backpropagation for Stochastic Computation GraphsCode0
Multi-Agent Reinforcement Learning: A Report on Challenges and ApproachesCode0
Variational Bayesian Reinforcement Learning with Regret Bounds0
A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior0
Learning to Play Pong using Policy Gradient Learning0
Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences0
Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors0
Asynchronous Advantage Actor-Critic Agent for Starcraft II0
NAVREN-RL: Learning to fly in real environment via end-to-end deep reinforcement learning using monocular images0
Learning Heuristics for Quantified Boolean Formulas through Deep Reinforcement LearningCode0
Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask DependenciesCode0
FuzzerGym: A Competitive Framework for Fuzzing and Learning0
Towards Explainable and Controllable Open Domain Dialogue Generation with Dialogue Acts0
Self-Organizing Maps as a Storage and Transfer Mechanism in Reinforcement Learning0
Representational efficiency outweighs action efficiency in human program induction0
News-based trading strategies0
Backplay: "Man muss immer umkehren"Code0
Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning GameCode0
Deep Reinforcement Learning for Swarm SystemsCode0
Foundations for Restraining Bolts: Reinforcement Learning with LTLf/LDLf restraining specifications0
Remember and Forget for Experience ReplayCode0
Safe Reinforcement Learning via Probabilistic Shields0
Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees0
Online Robust Policy Learning in the Presence of Unknown Adversaries0
Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms0
Bipedal Walking Robot using Deep Deterministic Policy GradientCode0
Exploring Hierarchy-Aware Inverse Reinforcement Learning0
An Affective Robot Companion for Assisting the Elderly in a Cognitive Game Scenario0
Visual Reinforcement Learning with Imagined GoalsCode2
The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach0
Will it Blend? Composing Value Functions in Reinforcement Learning0
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical GuaranteesCode0
CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving0
Is Q-learning Provably Efficient?Code1
Partial Policy-based Reinforcement Learning for Anatomical Landmark Localization in 3D Medical Images0
Video Summarisation by Classification with Deep Reinforcement Learning0
Auto Deep Compression by Reinforcement Learning Based Actor-Critic Structure0
Financial Trading as a Game: A Deep Reinforcement Learning ApproachCode0
Show:102550
← PrevPage 271 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified