SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1435114400 of 15113 papers

TitleStatusHype
A Machine Learning Approach to Routing0
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous ControlCode0
Decoupled Learning of Environment Characteristics for Safe Exploration0
Learning how to Active Learn: A Deep Reinforcement Learning ApproachCode0
Investigating Reinforcement Learning Agents for Continuous State Space Environments0
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-TuningCode0
Reinforced Video Captioning with Entailment Rewards0
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning0
Effective sketching methods for value function approximation0
The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task0
Reinforcement learning techniques for Outer Loop Link Adaptation in 4G/5G systems0
Variational Generative Stochastic Networks with Collaborative ShapingCode0
Deep Reinforcement Learning for Inquiry Dialog Policies with Logical Formula Embeddings0
Hierarchy Through Composition with Multitask LMDPs0
Grounding Language for Transfer in Deep Reinforcement LearningCode0
Using Reinforcement Learning to Model Incrementality in a Fast-Paced Dialogue Game0
Neural Optimizer Search using Reinforcement Learning0
World of Bits: An Open-Domain Platform for Web-Based Agents0
Plan, Attend, Generate: Character-Level Neural Machine Translation with Planning0
Spectrum Access In Cognitive Radio Using A Two Stage Reinforcement Learning Approach0
Advantages and Limitations of using Successor Features for Transfer in Reinforcement Learning0
Inverse Reinforcement Learning in Large State Spaces via Function Approximation0
Learning to Teach Reinforcement Learning Agents0
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse RewardsCode0
Learning Sparse Representations in Reinforcement Learning with Sparse Coding0
Guiding Reinforcement Learning Exploration Using Natural Language0
DARLA: Improving Zero-Shot Transfer in Reinforcement LearningCode0
Bellman Gradient Iteration for Inverse Reinforcement Learning0
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human FeedbackCode0
DeepPath: A Reinforcement Learning Method for Knowledge Graph ReasoningCode0
Imagination-Augmented Agents for Deep Reinforcement LearningCode0
Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning0
On-line Building Energy Optimization using Deep Reinforcement Learning0
Reverse Curriculum Generation for Reinforcement Learning0
Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning0
Trial without Error: Towards Safe Reinforcement Learning via Human InterventionCode0
Efficient Architecture Search by Network TransformationCode0
Freeway Merging in Congested Traffic based on Multipolicy Decision Making with Passive Actor Critic0
Distral: Robust Multitask Reinforcement Learning0
Representation Learning for Grounded Spatial ReasoningCode0
Fastest Convergence for Q-learning0
Autoencoder-augmented Neuroevolution for Visual Doom Playing0
Value Prediction NetworkCode0
Q-Learning Algorithm for VoLTE Closed-Loop Power Control in Indoor Small Cells0
Deep Reinforcement Learning Attention Selection for Person Re-Identification0
Deep Q-Learning for Self-Organizing Networks Fault Management and Radio Performance Improvement0
Learning human behaviors from motion capture by adversarial imitationCode0
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control0
The Complex Negotiation Dialogue Game0
Learning to Design Games: Strategic Environments in Reinforcement Learning0
Show:102550
← PrevPage 288 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified