Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator May 30, 2019 continuous-control Continuous Control
— Unverified 0Effective Medical Test Suggestions Using Deep Reinforcement Learning May 30, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Defining Admissible Rewards for High Confidence Policy Evaluation May 30, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Towards Finding Longer Proofs May 30, 2019 Automated Theorem Proving reinforcement-learning
Code Code Available 0Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation May 30, 2019 Clustering Diversity
Code Code Available 0Variance Reduction for Evolution Strategies via Structured Control Variates May 29, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Advantage Amplification in Slowly Evolving Latent-State Environments May 29, 2019 Recommendation Systems reinforcement-learning
— Unverified 0Linear interpolation gives better gradients than Gaussian smoothing in derivative-free optimization May 29, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0An Improved Convergence Analysis of Stochastic Variance-Reduced Policy Gradient May 29, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning with Policy Mixture Model for Temporal Point Processes Clustering May 29, 2019 Clustering Point Processes
— Unverified 0Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology May 29, 2019 Q-Learning Recommendation Systems
— Unverified 0CopyCAT: Taking Control of Neural Policies with Constant Attacks May 29, 2019 Atari Games Deep Reinforcement Learning
— Unverified 0Switching Linear Dynamics for Variational Bayes Filtering May 29, 2019 Bayesian Inference Model-based Reinforcement Learning
— Unverified 0On the Generalization Gap in Reparameterizable Reinforcement Learning May 29, 2019 Learning Theory reinforcement-learning
— Unverified 0Learning robust control for LQR systems with multiplicative noise via policy gradient May 28, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Conditions on Features for Temporal Difference-Like Methods to Converge May 28, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning May 28, 2019 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Beyond Exponentially Discounted Sum: Automatic Learning of Return Function May 28, 2019 Atari Games Form
— Unverified 0A General Markov Decision Process Framework for Directly Learning Optimal Control Policies May 28, 2019 Q-Learning Reinforcement Learning
— Unverified 0Generation of Policy-Level Explanations for Reinforcement Learning May 28, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Interactive Teaching Algorithms for Inverse Reinforcement Learning May 28, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Snooping Attacks on Deep Reinforcement Learning May 28, 2019 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning May 27, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning May 27, 2019 Q-Learning reinforcement-learning
Code Code Available 0Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies May 27, 2019 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Near-optimal Optimistic Reinforcement Learning using Empirical Bernstein Inequalities May 27, 2019 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Policy Search by Target Distribution Learning for Continuous Control May 27, 2019 continuous-control Continuous Control
— Unverified 0SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards May 27, 2019 Imitation Learning MuJoCo
Code Code Available 1Explainable Reinforcement Learning Through a Causal Lens May 27, 2019 counterfactual reinforcement-learning
Code Code Available 0Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning May 27, 2019 Decision Making Deep Reinforcement Learning
Code Code Available 0Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction May 27, 2019 continuous-control Continuous Control
— Unverified 0AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning May 27, 2019 Deep Reinforcement Learning Dialogue Management
— Unverified 0Interactive Differentiable Simulation May 26, 2019 Model Predictive Control parameter estimation
Code Code Available 2Selective Transfer with Reinforced Transfer Network for Partial Domain Adaptation May 26, 2019 Domain Adaptation Partial Domain Adaptation
— Unverified 0Variational Bayes: A report on approaches and applications May 26, 2019 Bayesian Inference Continual Learning
— Unverified 0Prioritized Sequence Experience Replay May 25, 2019 Deep Reinforcement Learning Q-Learning
— Unverified 0A Kernel Loss for Solving the Bellman Equation May 25, 2019 Q-Learning Reinforcement Learning
Code Code Available 0Transferable Cost-Aware Security Policy Implementation for Malware Detection Using Deep Reinforcement Learning May 25, 2019 Deep Reinforcement Learning Malware Detection
— Unverified 0Learning to Reason in Large Theories without Imitation May 25, 2019 Automated Theorem Proving Deep Reinforcement Learning
— Unverified 0Adversarial Policies: Attacking Deep Reinforcement Learning May 25, 2019 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Composing Task-Agnostic Policies with Deep Reinforcement Learning May 25, 2019 Decision Making Deep Reinforcement Learning
— Unverified 0Safe Reinforcement Learning with Nonlinear Dynamics via Model Predictive Shielding May 25, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0RL4health: Crowdsourcing Reinforcement Learning for Knee Replacement Pathway Optimization May 24, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0MQLV: Optimal Policy of Money Management in Retail Banking with Q-Learning May 24, 2019 Decision Making Management
— Unverified 0Exploration via Flow-Based Intrinsic Rewards May 24, 2019 Atari Games Optical Flow Estimation
Code Code Available 0InfoRL: Interpretable Reinforcement Learning using Information Maximization May 24, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer May 24, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0A Micro-Objective Perspective of Reinforcement Learning May 24, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound May 24, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Adaptive Symmetric Reward Noising for Reinforcement Learning May 24, 2019 Autonomous Driving Q-Learning
Code Code Available 0