SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1340113450 of 15113 papers

TitleStatusHype
Curriculum goal masking for continuous deep reinforcement learning0
Object-sensitive Deep Reinforcement Learning0
Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement LearningCode0
Transparency and Explanation in Deep Reinforcement Learning Neural Networks0
Improvements on Hindsight Learning0
Deterministic Implementations for Reproducibility in Deep Reinforcement LearningCode0
Adversarial Reinforcement Learning for Observer Design in Autonomous Systems under Cyber Attacks0
Towards Better Interpretability in Deep Q-NetworksCode0
Visual Diagnostics for Deep Reinforcement Learning Policy Development0
Model-Based Reinforcement Learning via Meta-Policy Optimization0
Online Cyber-Attack Detection in Smart Grid: A Reinforcement Learning ApproachCode0
Macquarie University at BioASQ 6b: Deep learning and deep reinforcement learning for query-based multi-document summarisation0
Auto-tuning Distributed Stream Processing Systems using Reinforcement Learning0
Improving Reinforcement Learning Based Image Captioning with Natural Language PriorCode0
CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement LearningCode0
Deep Reinforcement Learning for Event-Triggered ControlCode0
Coordination-driven learning in multi-agent problem spaces0
Image Captioning based on Deep Reinforcement Learning0
Negative Update Intervals in Deep Multi-Agent Reinforcement LearningCode1
Multi-task Deep Reinforcement Learning with PopArtCode0
Reinforcement Learning in Topology-based Representation for Human Body Movement with Whole Arm Manipulation0
Automatic, Personalized, and Flexible Playlist Generation using Reinforcement Learning0
Combined Reinforcement Learning via Abstract RepresentationsCode0
SAI, a Sensible Artificial Intelligence that plays GoCode1
VPE: Variational Policy Embedding for Transfer Reinforcement Learning0
Towards one-shot learning for rare-word translation with external experts0
ViZDoom Competitions: Playing Doom from PixelsCode0
A Multi-Agent Reinforcement Learning Method for Impression Allocation in Online Display Advertising0
Learning to Generate Structured Queries from Natural Language with Indirect Supervision0
Combining imagination and heuristics to learn strategies that generalizeCode0
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States0
Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement LearningCode0
Probabilistic Prediction of Interactive Driving Behavior via Hierarchical Inverse Reinforcement Learning0
Learning Invariances for Policy GeneralizationCode0
ANS: Adaptive Network Scaling for Deep Rectifier Reinforcement Learning Models0
Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a BenchmarkCode0
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience ReplayCode0
How to Combine Tree-Search Methods in Reinforcement Learning0
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning0
Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks0
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary PredictionCode0
Reinforcement Learning under ThreatsCode0
Recurrent World Models Facilitate Policy Evolution0
Transferring Deep Reinforcement Learning with Adversarial Objective and Augmentation0
Flatland: a Lightweight First-Person 2-D Environment for Reinforcement Learning0
Visual Transfer between Atari Games using Competitive Reinforcement LearningCode0
Natural Language Person Search Using Deep Reinforcement Learning0
Effective Exploration for Deep Reinforcement Learning via Bootstrapped Q-Ensembles under Tsallis Entropy Regularization0
Collaborative Deep Reinforcement Learning for Multi-Object Tracking0
Goal-Oriented Visual Question Generation via Intermediate Rewards0
Show:102550
← PrevPage 269 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified