Markov Decision Processes with Continuous Side Information Nov 15, 2017 PAC learning Reinforcement Learning
— Unverified 0Variational Adaptive-Newton Method for Explorative Learning Nov 15, 2017 Active Learning reinforcement-learning
— Unverified 0BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Nov 15, 2017 Deep Reinforcement Learning Efficient Exploration
— Unverified 0Finding Efficient Swimming Strategies in a Three Dimensional Chaotic Flow by Reinforcement Learning Nov 15, 2017 Navigate reinforcement-learning
— Unverified 0Costate-focused models for reinforcement learning Nov 15, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Saliency-based Sequential Image Attention with Multiset Prediction Nov 14, 2017 Classification General Classification
— Unverified 0Loss Functions for Multiset Prediction Nov 14, 2017 Decision Making Prediction
— Unverified 0Reinforcement Learning in a large scale photonic Recurrent Neural Network Nov 14, 2017 BIG-bench Machine Learning reinforcement-learning
— Unverified 0Classical Structured Prediction Losses for Sequence to Sequence Learning Nov 14, 2017 Abstractive Text Summarization Machine Translation
— Unverified 0SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning Nov 13, 2017 Decoder reinforcement-learning
Code Code Available 2Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection Nov 10, 2017 General Reinforcement Learning reinforcement-learning
— Unverified 0Towards the Use of Deep Reinforcement Learning with Global Policy For Query-based Extractive Summarisation Nov 10, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Accelerated Method for Stochastic Composition Optimization with Nonsmooth Regularization Nov 10, 2017 Management reinforcement-learning
— Unverified 0Applications of Deep Learning and Reinforcement Learning to Biological Data Nov 10, 2017 Deep Learning reinforcement-learning
— Unverified 0An Empirical Analysis of Multiple-Turn Reasoning Strategies in Reading Comprehension Tasks Nov 9, 2017 Descriptive Reading Comprehension
— Unverified 0Worm-level Control through Search-based Reinforcement Learning Nov 9, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0LatentPoison - Adversarial Attacks On The Latent Space Nov 8, 2017 Decoder General Classification
Code Code Available 0Energy Storage Arbitrage in Real-Time Markets via Reinforcement Learning Nov 8, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? Nov 7, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Double Q(σ) and Q(σ, λ): Unifying Reinforcement Learning Control Algorithms Nov 5, 2017 Q-Learning reinforcement-learning
— Unverified 0Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning Nov 4, 2017 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Policy Optimization by Genetic Distillation Nov 3, 2017 Deep Reinforcement Learning Imitation Learning
— Unverified 0Adaptive coordination of working-memory and reinforcement learning in non-human primates performing a trial-and-error problem solving task Nov 2, 2017 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning Nov 2, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Automatic Text Summarization Using Reinforcement Learning with Embedding Features Nov 1, 2017 ARC Information Retrieval
— Unverified 0Learning to Diagnose: Assimilating Clinical Narratives using Deep Reinforcement Learning Nov 1, 2017 Decision Making Deep Reinforcement Learning
— Unverified 0Intelligent Parameter Tuning in Optimization-based Iterative CT Reconstruction via Deep Reinforcement Learning Nov 1, 2017 CT Reconstruction Deep Reinforcement Learning
— Unverified 0Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning Nov 1, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Paraphrase Generation with Deep Reinforcement Learning Nov 1, 2017 Deep Reinforcement Learning Paraphrase Generation
— Unverified 0Regret Minimization for Partially Observable Deep Reinforcement Learning Oct 31, 2017 counterfactual Deep Reinforcement Learning
Code Code Available 0Backpropagation through the Void: Optimizing control variates for black-box gradient estimation Oct 31, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning Oct 31, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 0Visualizing and Understanding Atari Agents Oct 31, 2017 Deep Reinforcement Learning Reinforcement Learning
Code Code Available 0Automata-Guided Hierarchical Reinforcement Learning for Skill Composition Oct 31, 2017 Deep Reinforcement Learning Hierarchical Reinforcement Learning
— Unverified 0Learning Robust Rewards with Adversarial Inverse Reinforcement Learning Oct 30, 2017 Decision Making Deep Reinforcement Learning
Code Code Available 1Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming Oct 30, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Exponential improvements for quantum-accessible reinforcement learning Oct 30, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach Oct 30, 2017 Deep Reinforcement Learning Position
Code Code Available 0Action-depedent Control Variates for Policy Optimization via Stein's Identity Oct 30, 2017 Policy Gradient Methods reinforcement-learning
Code Code Available 0Eigenoption Discovery through the Deep Successor Representation Oct 30, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 1Artificial Intelligence as Structural Estimation: Economic Interpretations of Deep Blue, Bonanza, and AlphaGo Oct 30, 2017 Econometrics Reinforcement Learning
— Unverified 0Sequence-to-Sequence ASR Optimization via Reinforcement Learning Oct 30, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning Oct 28, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Distributional Reinforcement Learning with Quantile Regression Oct 27, 2017 Atari Games Distributional Reinforcement Learning
Code Code Available 0Inverse Reinforcement Learning Under Noisy Observations Oct 27, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning Oct 27, 2017 Atari Games Multi-Task Learning
Code Code Available 0Learning Approximate Stochastic Transition Models Oct 26, 2017 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Accelerated Reinforcement Learning Oct 23, 2017 Policy Gradient Methods reinforcement-learning
— Unverified 0Exploiting generalization in the subspaces for faster model-based learning Oct 22, 2017 Decision Making Reinforcement Learning
— Unverified 0Insulin Regimen ML-based control for T2DM patients Oct 21, 2017 Model-based Reinforcement Learning Reinforcement Learning
— Unverified 0