Avoiding Catastrophic States with Intrinsic Fear Jan 1, 2018 Atari Games Deep Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for List-wise Recommendations Dec 30, 2017 Deep Reinforcement Learning Recommendation Systems
Code Code Available 1Learning Structural Weight Uncertainty for Sequential Decision-Making Dec 30, 2017 Decision Making Multi-Armed Bandits
Code Code Available 0Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward Dec 29, 2017 Decision Making Deep Reinforcement Learning
Code Code Available 0SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation Dec 29, 2017 Q-Learning reinforcement-learning
— Unverified 0Reinforcement Learning with Analogical Similarity to Guide Schema Induction and Attention Dec 28, 2017 Analogical Similarity reinforcement-learning
— Unverified 0Multi-timescale memory dynamics in a reinforcement learning network with attention-gated memory Dec 28, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Consensus-based Sequence Training for Video Captioning Dec 27, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0RLlib: Abstractions for Distributed Reinforcement Learning Dec 26, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 4Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger Dec 23, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1A short variational proof of equivalence between policy gradients and soft Q learning Dec 22, 2017 Q-Learning reinforcement-learning
— Unverified 0Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning Dec 22, 2017 Deep Reinforcement Learning Efficient Exploration
Code Code Available 0Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator Dec 22, 2017 continuous-control Continuous Control
— Unverified 0Multiagent-based Participatory Urban Simulation through Inverse Reinforcement Learning Dec 21, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning Dec 20, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition Dec 20, 2017 Multi-Label Image Recognition reinforcement-learning
— Unverified 0Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning Dec 20, 2017 Minecraft reinforcement-learning
— Unverified 0Pseudorehearsal in actor-critic agents with neural network function approximation Dec 20, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Two-dimensional Anti-jamming Mobile Communication Based on Reinforcement Learning Dec 19, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0On Wasserstein Reinforcement Learning and the Fokker-Planck equation Dec 19, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0On the Relationship Between the OpenAI Evolution Strategy and Stochastic Gradient Descent Dec 18, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning Dec 18, 2017 Deep Reinforcement Learning Evolutionary Algorithms
Code Code Available 0ES Is More Than Just a Traditional Finite-Difference Approximator Dec 18, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents Dec 18, 2017 Deep Reinforcement Learning Policy Gradient Methods
Code Code Available 0Integral Equations and Machine Learning Dec 17, 2017 BIG-bench Machine Learning Image Generation
— Unverified 0Towards a Deep Reinforcement Learning Approach for Tower Line Wars Dec 17, 2017 Deep Reinforcement Learning Q-Learning
— Unverified 0Ray: A Distributed Framework for Emerging AI Applications Dec 16, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 4Occam's razor is insufficient to infer the preferences of irrational agents Dec 15, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Hierarchical Text Generation and Planning for Strategic Dialogue Dec 15, 2017 Decision Making reinforcement-learning
Code Code Available 0AI2-THOR: An Interactive 3D Environment for Visual AI Dec 14, 2017 Deep Reinforcement Learning Imitation Learning
Code Code Available 1Differentiable lower bound for expected BLEU score Dec 13, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Inverse Reinforcement Learning for Marketing Dec 13, 2017 Marketing reinforcement-learning
— Unverified 0Multi-focus Attention Network for Efficient Deep Reinforcement Learning Dec 13, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds Dec 13, 2017 Benchmarking Model-based Reinforcement Learning
Code Code Available 0Deep Reinforcement Learning Boosted by External Knowledge Dec 12, 2017 Atari Games Deep Reinforcement Learning
— Unverified 0A Low-Cost Ethics Shaping Approach for Designing Reinforcement Learning Agents Dec 12, 2017 Ethics reinforcement-learning
Code Code Available 0Interpretable Policies for Reinforcement Learning by Genetic Programming Dec 12, 2017 regression reinforcement-learning
— Unverified 0Simulated Autonomous Driving on Realistic Road Networks using Deep Reinforcement Learning Dec 12, 2017 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Robust Deep Reinforcement Learning with Adversarial Attacks Dec 11, 2017 Deep Reinforcement Learning Q-Learning
— Unverified 0MINOS: Multimodal Indoor Simulator for Navigation in Complex Environments Dec 11, 2017 Deep Reinforcement Learning Navigate
Code Code Available 0The Eigenoption-Critic Framework Dec 11, 2017 Efficient Exploration Hierarchical Reinforcement Learning
— Unverified 0Reinforced dynamics for enhanced sampling in large atomic and molecular systems Dec 10, 2017 Deep Reinforcement Learning Efficient Exploration
— Unverified 0Stochastic Answer Networks for Machine Reading Comprehension Dec 10, 2017 Machine Reading Comprehension Question Answering
Code Code Available 0Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality Dec 7, 2017 Q-Learning reinforcement-learning
— Unverified 0End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient Dec 7, 2017 Decoder Goal-Oriented Dialog
— Unverified 0Noisy Natural Gradient as Variational Inference Dec 6, 2017 Active Learning Efficient Exploration
Code Code Available 0Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Dec 5, 2017 Game of Chess Game of Go
Code Code Available 1A Deeper Look at Experience Replay Dec 4, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 0Interactive Reinforcement Learning for Object Grounding via Self-Talking Dec 2, 2017 Object reinforcement-learning
— Unverified 0Representation and Reinforcement Learning for Personalized Glycemic Control in Septic Patients Dec 2, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0