SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1470114750 of 15113 papers

TitleStatusHype
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable0
Quantum-enhanced machine learning0
Reinforcement Learning in Conflicting Environments for Autonomous Vehicles0
Utilization of Deep Reinforcement Learning for saccadic-based object visual search0
Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies0
A Reinforcement Learning Approach to the View Planning Problem0
Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data0
The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits0
Sim-to-Real Robot Learning from Pixels with Progressive Nets0
Reset-free Trial-and-Error Learning for Robot Damage RecoveryCode0
Introduction to the "Industrial Benchmark"0
Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving0
Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation0
Personalizing a Dialogue System with Transfer Reinforcement Learning0
Multi-Objective Deep Reinforcement LearningCode0
Deep Reinforcement Learning From Raw Pixels in Doom0
Active exploration in parameterized reinforcement learningCode0
Connecting Generative Adversarial Networks and Actor-Critic Methods0
Towards Cognitive Exploration through Deep Reinforcement Learning for Mobile Robots0
Reset-Free Guided Policy Search: Efficient Deep Reinforcement Learning with Stochastic Initial States0
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates0
Deep Visual Foresight for Planning Robot MotionCode0
Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search0
Deep Reinforcement Learning for Tensegrity Robot Locomotion0
Deep Reinforcement Learning for Mention-Ranking Coreference ModelsCode0
UbuntuWorld 1.0 LTS - A Platform for Automated Problem Solving & Troubleshooting in the Ubuntu OS0
Regulating Reward Training by Means of Certainty Prediction in a Neural Network-Implemented Pong Game0
Input Convex Neural NetworksCode0
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer0
Modelling Stock-market Investors as Reinforcement Learning Agents [Correction]0
Towards Deep Symbolic Reinforcement Learning0
Opponent Modeling in Deep Reinforcement LearningCode0
Playing FPS Games with Deep Reinforcement LearningCode0
SeqGAN: Sequence Generative Adversarial Nets with Policy GradientCode0
Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement LearningCode0
The Option-Critic ArchitectureCode0
Exploration Potential0
Interactive Spoken Content Retrieval by Deep Reinforcement Learning0
Bayesian Reinforcement Learning: A Survey0
Stochastic evolution in populations of ideas0
A Threshold-based Scheme for Reinforcement Learning in Neural NetworksCode0
A centralized reinforcement learning method for multi-agent job scheduling in Grid0
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks0
Dialogue manager domain adaptation using Gaussian process reinforcement learning0
Unifying task specification in reinforcement learning0
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information AccessCode0
Reward Function and Initial Values: Better Choices for Accelerated Goal-Directed Reinforcement Learning0
Single photon in hierarchical architecture for physical reinforcement learning: Photon intelligence0
Adaptive Probabilistic Trajectory Optimization via Efficient Approximate Inference0
Modeling Human Reading with Neural Attention0
Show:102550
← PrevPage 295 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified