Combination of Supervised and Reinforcement Learning For Vision-Based Autonomous Control Jan 1, 2018 MuJoCo reinforcement-learning
— Unverified 0Learning Dynamic State Abstractions for Model-Based Reinforcement Learning Jan 1, 2018 Atari Games Decision Making
— Unverified 0Learning an Embedding Space for Transferable Robot Skills Jan 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Learning to Treat Sepsis with Multi-Output Gaussian Process Deep Recurrent Q-Networks Jan 1, 2018 Deep Reinforcement Learning Gaussian Processes
— Unverified 0Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning Jan 1, 2018 Autonomous Vehicles Decision Making
— Unverified 0Avoiding Catastrophic States with Intrinsic Fear Jan 1, 2018 Atari Games Deep Reinforcement Learning
— Unverified 0Representing Entropy : A short proof of the equivalence between soft Q-learning and policy gradients Jan 1, 2018 Q-Learning reinforcement-learning
— Unverified 0NerveNet: Learning Structured Policy with Graph Neural Networks Jan 1, 2018 Benchmarking continuous-control
Code Code Available 0Reinforcement Learning via Replica Stacking of Quantum Measurements for the Training of Quantum Boltzmann Machines Jan 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Gradient For Multidimensional Action Spaces: Action Sampling and Entropy Bonus Jan 1, 2018 Atari Games Deep Reinforcement Learning
— Unverified 0Neural Task Graph Execution Jan 1, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Universal Agent for Disentangling Environments and Tasks Jan 1, 2018 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Model-based imitation learning from state trajectories Jan 1, 2018 Imitation Learning model
— Unverified 0Predicting Multiple Actions for Stochastic Continuous Control Jan 1, 2018 continuous-control Continuous Control
— Unverified 0Neuron as an Agent Jan 1, 2018 counterfactual Multi-agent Reinforcement Learning
— Unverified 0Using Deep Reinforcement Learning to Generate Rationales for Molecules Jan 1, 2018 Deep Reinforcement Learning Drug Design
— Unverified 0Residual Loss Prediction: Reinforcement Learning With No Incremental Feedback Jan 1, 2018 Multi-Armed Bandits Prediction
Code Code Available 0LSD-Net: Look, Step and Detect for Joint Navigation and Multi-View Recognition with Deep Reinforcement Learning Jan 1, 2018 Deep Reinforcement Learning General Classification
— Unverified 0Reward Estimation via State Prediction Jan 1, 2018 Prediction reinforcement-learning
— Unverified 0Now I Remember! Episodic Memory For Reinforcement Learning Jan 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Learning Structural Weight Uncertainty for Sequential Decision-Making Dec 30, 2017 Decision Making Multi-Armed Bandits
Code Code Available 0Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward Dec 29, 2017 Decision Making Deep Reinforcement Learning
Code Code Available 0SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation Dec 29, 2017 Q-Learning reinforcement-learning
— Unverified 0Multi-timescale memory dynamics in a reinforcement learning network with attention-gated memory Dec 28, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Reinforcement Learning with Analogical Similarity to Guide Schema Induction and Attention Dec 28, 2017 Analogical Similarity reinforcement-learning
— Unverified 0Consensus-based Sequence Training for Video Captioning Dec 27, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0A short variational proof of equivalence between policy gradients and soft Q learning Dec 22, 2017 Q-Learning reinforcement-learning
— Unverified 0Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator Dec 22, 2017 continuous-control Continuous Control
— Unverified 0Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning Dec 22, 2017 Deep Reinforcement Learning Efficient Exploration
Code Code Available 0Multiagent-based Participatory Urban Simulation through Inverse Reinforcement Learning Dec 21, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition Dec 20, 2017 Multi-Label Image Recognition reinforcement-learning
— Unverified 0Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning Dec 20, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Pseudorehearsal in actor-critic agents with neural network function approximation Dec 20, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning Dec 20, 2017 Minecraft reinforcement-learning
— Unverified 0Two-dimensional Anti-jamming Mobile Communication Based on Reinforcement Learning Dec 19, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0On Wasserstein Reinforcement Learning and the Fokker-Planck equation Dec 19, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0On the Relationship Between the OpenAI Evolution Strategy and Stochastic Gradient Descent Dec 18, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0ES Is More Than Just a Traditional Finite-Difference Approximator Dec 18, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents Dec 18, 2017 Deep Reinforcement Learning Policy Gradient Methods
Code Code Available 0Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning Dec 18, 2017 Deep Reinforcement Learning Evolutionary Algorithms
Code Code Available 0Integral Equations and Machine Learning Dec 17, 2017 BIG-bench Machine Learning Image Generation
— Unverified 0Towards a Deep Reinforcement Learning Approach for Tower Line Wars Dec 17, 2017 Deep Reinforcement Learning Q-Learning
— Unverified 0Occam's razor is insufficient to infer the preferences of irrational agents Dec 15, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Hierarchical Text Generation and Planning for Strategic Dialogue Dec 15, 2017 Decision Making reinforcement-learning
Code Code Available 0Differentiable lower bound for expected BLEU score Dec 13, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Inverse Reinforcement Learning for Marketing Dec 13, 2017 Marketing reinforcement-learning
— Unverified 0QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds Dec 13, 2017 Benchmarking Model-based Reinforcement Learning
Code Code Available 0Multi-focus Attention Network for Efficient Deep Reinforcement Learning Dec 13, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Simulated Autonomous Driving on Realistic Road Networks using Deep Reinforcement Learning Dec 12, 2017 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Deep Reinforcement Learning Boosted by External Knowledge Dec 12, 2017 Atari Games Deep Reinforcement Learning
— Unverified 0