MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence Dec 2, 2017 GPU Multi-agent Reinforcement Learning
Code Code Available 0Online Reinforcement Learning in Stochastic Games Dec 2, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Progressive Neural Architecture Search Dec 2, 2017 Evolutionary Algorithms General Classification
Code Code Available 0Natural Value Approximators: Learning when to Trust Past Estimates Dec 1, 2017 Atari Games Inductive Bias
— Unverified 0Log-normality and Skewness of Estimated State/Action Values in Reinforcement Learning Dec 1, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Optimistic posterior sampling for reinforcement learning: worst-case regret bounds Dec 1, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes Dec 1, 2017 Decision Making Deep Reinforcement Learning
— Unverified 0Adaptive Batch Size for Safe Policy Gradients Dec 1, 2017 Policy Gradient Methods Reinforcement Learning
— Unverified 0Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs Dec 1, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Dynamic-Depth Context Tree Weighting Dec 1, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Compatible Reward Inverse Reinforcement Learning Dec 1, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Time Limits in Reinforcement Learning Dec 1, 2017 General Reinforcement Learning reinforcement-learning
Code Code Available 1Safe Exploration for Identifying Linear Systems via Robust Optimization Nov 30, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Transferring Autonomous Driving Knowledge on Simulated and Real Intersections Nov 30, 2017 Autonomous Driving Autonomous Vehicles
— Unverified 0Embodied Question Answering Nov 30, 2017 Embodied Question Answering Navigate
Code Code Available 0Improved Learning in Evolution Strategies via Sparser Inter-Agent Network Topologies Nov 30, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control Nov 30, 2017 continuous-control Continuous Control
— Unverified 0Can Complex Collective Behaviour Be Generated Through Randomness, Memory and a Pinch of Luck? Nov 29, 2017 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0HoME: a Household Multimodal Environment Nov 29, 2017 OpenAI Gym reinforcement-learning
— Unverified 0End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning Nov 29, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Automating Vehicles by Deep Reinforcement Learning using Task Separation with Hill Climbing Nov 29, 2017 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Reinforcement Learning To Adapt Speech Enhancement to Instantaneous Input Signal Quality Nov 29, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Video Captioning via Hierarchical Reinforcement Learning Nov 29, 2017 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management Nov 29, 2017 Benchmarking Deep Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for De-Novo Drug Design Nov 29, 2017 Deep Reinforcement Learning Drug Design
Code Code Available 0Hierarchical Policy Search via Return-Weighted Density Estimation Nov 28, 2017 Density Estimation Motion Planning
— Unverified 0One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay Nov 28, 2017 Navigate reinforcement-learning
Code Code Available 1Plan, Attend, Generate: Planning for Sequence-to-Sequence Models Nov 28, 2017 Question Generation Question-Generation
Code Code Available 1Learning from Longitudinal Face Demonstration - Where Tractable Deep Modeling Meets Inverse Reinforcement Learning Nov 28, 2017 Face Verification MORPH
— Unverified 0Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods Nov 28, 2017 Decision Making reinforcement-learning
Code Code Available 0A reinforcement learning algorithm for building collaboration in multi-agent systems Nov 28, 2017 Q-Learning reinforcement-learning
— Unverified 0Crossmodal Attentive Skill Learner Nov 28, 2017 Atari Games CPU
Code Code Available 0Deep Reinforcement Learning for Sepsis Treatment Nov 27, 2017 Decision Making Deep Reinforcement Learning
Code Code Available 0AI Safety Gridworlds Nov 27, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Divide-and-Conquer Reinforcement Learning Nov 27, 2017 Deep Reinforcement Learning Policy Gradient Methods
Code Code Available 0Generative Adversarial Network for Abstractive Text Summarization Nov 26, 2017 Abstractive Text Summarization Generative Adversarial Network
Code Code Available 0Malaria Likelihood Prediction By Effectively Surveying Households Using Deep Reinforcement Learning Nov 25, 2017 Deep Reinforcement Learning Holdout Set
— Unverified 0Ethical Challenges in Data-Driven Dialogue Systems Nov 24, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Cascade Attribute Learning Network Nov 24, 2017 Attribute Position
— Unverified 0Action Branching Architectures for Deep Reinforcement Learning Nov 24, 2017 continuous-control Continuous Control
Code Code Available 1Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards Nov 21, 2017 Deep Reinforcement Learning Informativeness
— Unverified 0Transferring Agent Behaviors from Videos via Motion GANs Nov 21, 2017 General Reinforcement Learning Generative Adversarial Network
— Unverified 0Posterior Sampling for Large Scale Reinforcement Learning Nov 21, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Teaching a Machine to Read Maps with Deep Reinforcement Learning Nov 20, 2017 Deep Reinforcement Learning Navigate
Code Code Available 0Classification with Costly Features using Deep Reinforcement Learning Nov 20, 2017 Classification Classification with Costly Features
Code Code Available 0Deep Reinforcement Learning for Multi-Resource Multi-Machine Job Scheduling Nov 20, 2017 CPU Deep Reinforcement Learning
— Unverified 0Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning Nov 18, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Run, skeleton, run: skeletal model in a physics-based simulation Nov 18, 2017 Navigate Policy Gradient Methods
Code Code Available 0Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction Nov 18, 2017 parameter estimation Q-Learning
— Unverified 0Hindsight policy gradients Nov 16, 2017 Policy Gradient Methods reinforcement-learning
Code Code Available 0