SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1460114650 of 15113 papers

TitleStatusHype
Collaborative Deep Reinforcement LearningCode0
Collaborative Deep Reinforcement Learning for Joint Object Search0
Batch Policy Gradient Methods for Improving Neural Conversation Models0
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learningCode0
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning0
Autonomous Braking System via Deep Reinforcement LearningCode0
Semi-Supervised QA with Generative Domain-Adaptive Nets0
Uncertainty-Aware Reinforcement Learning for Collision Avoidance0
Deep Reinforcement Learning for Robotic Manipulation-The state of the art0
Deep Reinforcement Learning for Visual Object Tracking in Videos0
Flow Navigation by Smart Microswimmers via Reinforcement Learning0
Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning0
Reinforcement Learning Algorithm Selection0
PathNet: Evolution Channels Gradient Descent in Super Neural NetworksCode0
Learning Light Transport the Reinforced WayCode0
Deep Reinforcement Learning: An OverviewCode0
Artificial Intelligence Approaches To UCAV Autonomy0
Adversarial Learning for Neural Dialogue GenerationCode0
Regularizing Neural Networks by Penalizing Confident Output DistributionsCode0
Binary Matrix Guessing Problem0
Basic protocols in quantum reinforcement learning with superconducting circuits0
Vulnerability of Deep Reinforcement Learning to Policy Induction AttacksCode0
Near Optimal Behavior via Approximate State AbstractionCode0
Agent-Agnostic Human-in-the-Loop Reinforcement Learning0
Scalable and Incremental Learning of Gaussian Mixture Models0
Real-Time Bidding by Reinforcement Learning in Display AdvertisingCode0
Reinforcement Learning based Embodied Agents Modelling Human Users Through Interaction and Multi-Sensory Perception0
Reinforcement Learning via Recurrent Convolutional Neural NetworksCode0
Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making0
A Review of Neural Network Based Machine Learning Approaches for Rotor Angle Stability Control0
First-Person Activity Forecasting with Online Inverse Reinforcement Learning0
On the function approximation error for risk-sensitive reinforcement learning0
Non-Deterministic Policy Improvement Stabilizes Approximated Reinforcement Learning0
A Survey of Deep Network Solutions for Learning Control in Robotics: From Reinforcement to ImitationCode0
Loss is its own Reward: Self-Supervision for Reinforcement Learning0
Unsupervised Perceptual Rewards for Imitation Learning0
Self-Correcting Models for Model-Based Reinforcement LearningCode0
Sample-efficient Deep Reinforcement Learning for Dialog Control0
Reinforcement Learning Using Quantum Boltzmann Machines0
A User Simulator for Task-Completion DialoguesCode0
Learning to predict where to look in interactive environments using deep recurrent q-learning0
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments0
Separation of Concerns in Reinforcement Learning0
Response to Comment on 'Perceptual Learning Incepted by Decoded fMRI Neurofeedback Without Stimulus Presentation'; How can a decoded neurofeedback method (DecNef) lead to successful reinforcement and visual perceptual learning?0
Incorporating Human Domain Knowledge into Large Scale Cost Function Learning0
End-to-End Deep Reinforcement Learning for Lane Keeping Assist0
Learning to Drive using Inverse Reinforcement Learning and Deep Q-Networks0
Online Reinforcement Learning for Real-Time Exploration in Continuous State and Action Markov Decision Processes0
PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning0
Reinforcement Learning With Temporal Logic Rewards0
Show:102550
← PrevPage 293 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified