SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1360113650 of 15113 papers

TitleStatusHype
Interpretable Rationale Augmented Charge Prediction System0
Distantly Supervised NER with Partial Annotation Learning and Reinforcement LearningCode0
Learning Dexterous In-Hand Manipulation0
A New Concept of Deep Reinforcement Learning based Augmented General Tagging System0
Count-Based Exploration with the Successor RepresentationCode0
Learning to Interrupt: A Hierarchical Deep Reinforcement Learning Framework for Efficient Exploration0
Improving Spatiotemporal Self-Supervision by Deep Reinforcement Learning0
Optimal Tap Setting of Voltage Regulation Transformers Using Batch Reinforcement Learning0
Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning0
A Reinforcement Learning Approach to Target Tracking in a Camera Network0
Backprop-Q: Generalized Backpropagation for Stochastic Computation GraphsCode0
Multi-Agent Reinforcement Learning: A Report on Challenges and ApproachesCode0
Variational Bayesian Reinforcement Learning with Regret Bounds0
A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior0
Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences0
Learning to Play Pong using Policy Gradient Learning0
Asynchronous Advantage Actor-Critic Agent for Starcraft II0
Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors0
NAVREN-RL: Learning to fly in real environment via end-to-end deep reinforcement learning using monocular images0
Learning Heuristics for Quantified Boolean Formulas through Deep Reinforcement LearningCode0
FuzzerGym: A Competitive Framework for Fuzzing and Learning0
Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask DependenciesCode0
Towards Explainable and Controllable Open Domain Dialogue Generation with Dialogue Acts0
Self-Organizing Maps as a Storage and Transfer Mechanism in Reinforcement Learning0
News-based trading strategies0
Representational efficiency outweighs action efficiency in human program induction0
Backplay: "Man muss immer umkehren"Code0
Deep Reinforcement Learning for Swarm SystemsCode0
Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning GameCode0
Foundations for Restraining Bolts: Reinforcement Learning with LTLf/LDLf restraining specifications0
Safe Reinforcement Learning via Probabilistic Shields0
Remember and Forget for Experience ReplayCode0
Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees0
Online Robust Policy Learning in the Presence of Unknown Adversaries0
Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms0
Bipedal Walking Robot using Deep Deterministic Policy GradientCode0
Exploring Hierarchy-Aware Inverse Reinforcement Learning0
An Affective Robot Companion for Assisting the Elderly in a Cognitive Game Scenario0
The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach0
Will it Blend? Composing Value Functions in Reinforcement Learning0
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical GuaranteesCode0
CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving0
Video Summarisation by Classification with Deep Reinforcement Learning0
Partial Policy-based Reinforcement Learning for Anatomical Landmark Localization in 3D Medical Images0
Auto Deep Compression by Reinforcement Learning Based Actor-Critic Structure0
Financial Trading as a Game: A Deep Reinforcement Learning ApproachCode0
End-to-End Race Driving with Deep Reinforcement Learning0
Variance Reduction for Reinforcement Learning in Input-Driven Environments0
Deep Reinforcement Learning for Doom using Unsupervised Auxiliary Tasks0
Arcades: A deep model for adaptive decision making in voice controlled smart-home0
Show:102550
← PrevPage 273 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified