SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1350113550 of 15113 papers

TitleStatusHype
Multiobjective Reinforcement Learning for Reconfigurable Adaptive Optimal Control of Manufacturing Processes0
Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning0
Object-sensitive Deep Reinforcement Learning0
Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement LearningCode0
Transparency and Explanation in Deep Reinforcement Learning Neural Networks0
Automata Guided Reinforcement Learning With Demonstrations0
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning0
Adversarial Imitation via Variational Inverse Reinforcement Learning0
Curriculum goal masking for continuous deep reinforcement learning0
Improvements on Hindsight Learning0
Deterministic Implementations for Reproducibility in Deep Reinforcement LearningCode0
Adversarial Reinforcement Learning for Observer Design in Autonomous Systems under Cyber Attacks0
Towards Better Interpretability in Deep Q-NetworksCode0
Macquarie University at BioASQ 6b: Deep learning and deep reinforcement learning for query-based multi-document summarisation0
Online Cyber-Attack Detection in Smart Grid: A Reinforcement Learning ApproachCode0
Visual Diagnostics for Deep Reinforcement Learning Policy Development0
Model-Based Reinforcement Learning via Meta-Policy Optimization0
Auto-tuning Distributed Stream Processing Systems using Reinforcement Learning0
CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement LearningCode0
Image Captioning based on Deep Reinforcement Learning0
Improving Reinforcement Learning Based Image Captioning with Natural Language PriorCode0
Coordination-driven learning in multi-agent problem spaces0
Deep Reinforcement Learning for Event-Triggered ControlCode0
Combined Reinforcement Learning via Abstract RepresentationsCode0
Automatic, Personalized, and Flexible Playlist Generation using Reinforcement Learning0
Multi-task Deep Reinforcement Learning with PopArtCode0
Reinforcement Learning in Topology-based Representation for Human Body Movement with Whole Arm Manipulation0
ViZDoom Competitions: Playing Doom from PixelsCode0
VPE: Variational Policy Embedding for Transfer Reinforcement Learning0
Towards one-shot learning for rare-word translation with external experts0
Learning to Generate Structured Queries from Natural Language with Indirect Supervision0
Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement LearningCode0
Combining imagination and heuristics to learn strategies that generalizeCode0
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States0
A Multi-Agent Reinforcement Learning Method for Impression Allocation in Online Display Advertising0
Probabilistic Prediction of Interactive Driving Behavior via Hierarchical Inverse Reinforcement Learning0
Learning Invariances for Policy GeneralizationCode0
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience ReplayCode0
How to Combine Tree-Search Methods in Reinforcement Learning0
Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a BenchmarkCode0
ANS: Adaptive Network Scaling for Deep Rectifier Reinforcement Learning Models0
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning0
Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks0
Reinforcement Learning under ThreatsCode0
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary PredictionCode0
Transferring Deep Reinforcement Learning with Adversarial Objective and Augmentation0
Recurrent World Models Facilitate Policy Evolution0
Flatland: a Lightweight First-Person 2-D Environment for Reinforcement Learning0
Effective Exploration for Deep Reinforcement Learning via Bootstrapped Q-Ensembles under Tsallis Entropy Regularization0
Visual Transfer between Atari Games using Competitive Reinforcement LearningCode0
Show:102550
← PrevPage 271 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified