Don't Forget Your Teacher: A Corrective Reinforcement Learning Framework May 30, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Effective Medical Test Suggestions Using Deep Reinforcement Learning May 30, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Combating the Compounding-Error Problem with a Multi-step Model May 30, 2019 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Advantage Amplification in Slowly Evolving Latent-State Environments May 29, 2019 Recommendation Systems reinforcement-learning
— Unverified 0An Improved Convergence Analysis of Stochastic Variance-Reduced Policy Gradient May 29, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Linear interpolation gives better gradients than Gaussian smoothing in derivative-free optimization May 29, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning with Policy Mixture Model for Temporal Point Processes Clustering May 29, 2019 Clustering Point Processes
— Unverified 0CopyCAT: Taking Control of Neural Policies with Constant Attacks May 29, 2019 Atari Games Deep Reinforcement Learning
— Unverified 0On the Generalization Gap in Reparameterizable Reinforcement Learning May 29, 2019 Learning Theory reinforcement-learning
— Unverified 0Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology May 29, 2019 Q-Learning Recommendation Systems
— Unverified 0Switching Linear Dynamics for Variational Bayes Filtering May 29, 2019 Bayesian Inference Model-based Reinforcement Learning
— Unverified 0Variance Reduction for Evolution Strategies via Structured Control Variates May 29, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0A General Markov Decision Process Framework for Directly Learning Optimal Control Policies May 28, 2019 Q-Learning Reinforcement Learning
— Unverified 0Conditions on Features for Temporal Difference-Like Methods to Converge May 28, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Beyond Exponentially Discounted Sum: Automatic Learning of Return Function May 28, 2019 Atari Games Form
— Unverified 0Generation of Policy-Level Explanations for Reinforcement Learning May 28, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Interactive Teaching Algorithms for Inverse Reinforcement Learning May 28, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Learning robust control for LQR systems with multiplicative noise via policy gradient May 28, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning May 27, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning May 27, 2019 Q-Learning reinforcement-learning
Code Code Available 0Explainable Reinforcement Learning Through a Causal Lens May 27, 2019 counterfactual reinforcement-learning
Code Code Available 0Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction May 27, 2019 continuous-control Continuous Control
— Unverified 0AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning May 27, 2019 Deep Reinforcement Learning Dialogue Management
— Unverified 0Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning May 27, 2019 Decision Making Deep Reinforcement Learning
Code Code Available 0Policy Search by Target Distribution Learning for Continuous Control May 27, 2019 continuous-control Continuous Control
— Unverified 0Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies May 27, 2019 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Near-optimal Optimistic Reinforcement Learning using Empirical Bernstein Inequalities May 27, 2019 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Selective Transfer with Reinforced Transfer Network for Partial Domain Adaptation May 26, 2019 Domain Adaptation Partial Domain Adaptation
— Unverified 0Variational Bayes: A report on approaches and applications May 26, 2019 Bayesian Inference Continual Learning
— Unverified 0Safe Reinforcement Learning with Nonlinear Dynamics via Model Predictive Shielding May 25, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Composing Task-Agnostic Policies with Deep Reinforcement Learning May 25, 2019 Decision Making Deep Reinforcement Learning
— Unverified 0Prioritized Sequence Experience Replay May 25, 2019 Deep Reinforcement Learning Q-Learning
— Unverified 0Learning to Reason in Large Theories without Imitation May 25, 2019 Automated Theorem Proving Deep Reinforcement Learning
— Unverified 0Transferable Cost-Aware Security Policy Implementation for Malware Detection Using Deep Reinforcement Learning May 25, 2019 Deep Reinforcement Learning Malware Detection
— Unverified 0A Kernel Loss for Solving the Bellman Equation May 25, 2019 Q-Learning Reinforcement Learning
Code Code Available 0Exploration via Flow-Based Intrinsic Rewards May 24, 2019 Atari Games Optical Flow Estimation
Code Code Available 0A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer May 24, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Adaptive Symmetric Reward Noising for Reinforcement Learning May 24, 2019 Autonomous Driving Q-Learning
Code Code Available 0RL4health: Crowdsourcing Reinforcement Learning for Knee Replacement Pathway Optimization May 24, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Automatic Machine Learning by Pipeline Synthesis using Model-Based Reinforcement Learning and a Grammar May 24, 2019 AutoML Bayesian Optimization
— Unverified 0MQLV: Optimal Policy of Money Management in Retail Banking with Q-Learning May 24, 2019 Decision Making Management
— Unverified 0InfoRL: Interpretable Reinforcement Learning using Information Maximization May 24, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Continual Reinforcement Learning in 3D Non-stationary Environments May 24, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0A Micro-Objective Perspective of Reinforcement Learning May 24, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound May 24, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0PAC Guarantees for Cooperative Multi-Agent Reinforcement Learning with Restricted Communication May 23, 2019 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Population-based Global Optimisation Methods for Learning Long-term Dependencies with RNNs May 23, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal May 23, 2019 Decision Making Deep Reinforcement Learning
Code Code Available 0Recurrent Value Functions May 23, 2019 continuous-control Continuous Control
— Unverified 0Hierarchical Reinforcement Learning for Concurrent Discovery of Compound and Composable Policies May 23, 2019 Hierarchical Reinforcement Learning reinforcement-learning
Code Code Available 0