Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning Sep 25, 2019 continuous-control Continuous Control
— Unverified 0QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error Sep 25, 2019 continuous-control Continuous Control
— Unverified 0Samples Are Useful? Not Always: denoising policy gradient updates using variance explained Sep 25, 2019 continuous-control Continuous Control
— Unverified 0Learning Functionally Decomposed Hierarchies for Continuous Navigation Tasks Sep 25, 2019 continuous-control Continuous Control
— Unverified 0Policy Optimization In the Face of Uncertainty Sep 25, 2019 continuous-control Continuous Control
— Unverified 0Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning Sep 23, 2019 continuous-control Continuous Control
Code Code Available 0How Much Do Unstated Problem Constraints Limit Deep Robotic Reinforcement Learning? Sep 20, 2019 continuous-control Continuous Control
— Unverified 0Meta-Inverse Reinforcement Learning with Probabilistic Context Variables Sep 20, 2019 continuous-control Continuous Control
Code Code Available 0Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning Sep 17, 2019 continuous-control Continuous Control
— Unverified 0Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space Sep 15, 2019 continuous-control Continuous Control
— Unverified 0Biased Estimates of Advantages over Path Ensembles Sep 15, 2019 Atari Games continuous-control
— Unverified 0Driving in Dense Traffic with Model-Free Reinforcement Learning Sep 15, 2019 continuous-control Continuous Control
Code Code Available 0VILD: Variational Imitation Learning with Diverse-quality Demonstrations Sep 15, 2019 continuous-control Continuous Control
— Unverified 0Deterministic Value-Policy Gradients Sep 9, 2019 continuous-control Continuous Control
— Unverified 0Learning Action-Transferable Policy with Action Embedding Sep 5, 2019 Continuous Control Reinforcement Learning
Code Code Available 0Generalization in Transfer Learning Sep 3, 2019 continuous-control Continuous Control
— Unverified 0Dynamics-aware Embeddings Aug 25, 2019 continuous-control Continuous Control
Code Code Available 0Model-based Lookahead Reinforcement Learning Aug 15, 2019 continuous-control Continuous Control
— Unverified 0Continuous Control for High-Dimensional State Spaces: An Interactive Learning Approach Aug 14, 2019 continuous-control Continuous Control
— Unverified 0Inverse Rational Control with Partially Observable Continuous Nonlinear Dynamics Aug 13, 2019 continuous-control Continuous Control
— Unverified 0Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning Aug 6, 2019 continuous-control Continuous Control
— Unverified 0Neural Simplex Architecture Aug 1, 2019 continuous-control Continuous Control
— Unverified 0Learning Stabilizable Nonlinear Dynamics with Contraction-Based Regularization Jul 29, 2019 continuous-control Continuous Control
Code Code Available 0A Model-based Approach for Sample-efficient Multi-task Reinforcement Learning Jul 11, 2019 continuous-control Continuous Control
— Unverified 0Imitation-Projected Programmatic Reinforcement Learning Jul 11, 2019 continuous-control Continuous Control
— Unverified 0On-Policy Robot Imitation Learning from a Converging Supervisor Jul 8, 2019 continuous-control Continuous Control
— Unverified 0On Inductive Biases in Deep Reinforcement Learning Jul 5, 2019 continuous-control Continuous Control
— Unverified 0Co-training for Policy Learning Jul 3, 2019 Combinatorial Optimization continuous-control
Code Code Available 0Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model Jul 1, 2019 continuous-control Continuous Control
Code Code Available 0FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control Jul 1, 2019 continuous-control Continuous Control
— Unverified 0Policy Optimization with Stochastic Mirror Descent Jun 25, 2019 Continuous Control Policy Gradient Methods
— Unverified 0Uncertainty-aware Model-based Policy Optimization Jun 25, 2019 continuous-control Continuous Control
— Unverified 0Learning Belief Representations for Imitation Learning in POMDPs Jun 22, 2019 continuous-control Continuous Control
Code Code Available 0Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction Jun 21, 2019 Autonomous Driving continuous-control
— Unverified 0Max-Plus Matching Pursuit for Deterministic Markov Decision Processes Jun 20, 2019 continuous-control Continuous Control
— Unverified 0Experience Replay Optimization Jun 19, 2019 continuous-control Continuous Control
— Unverified 0Reward Prediction Error as an Exploration Objective in Deep RL Jun 19, 2019 Atari Games Continuous Control
— Unverified 0Unsupervised Learning of Object Structure and Dynamics from Videos Jun 19, 2019 Action Recognition continuous-control
— Unverified 0Robust Reinforcement Learning for Continuous Control with Model Misspecification Jun 18, 2019 continuous-control Continuous Control
— Unverified 0Conditioning of Reinforcement Learning Agents and its Policy Regularization Application Jun 13, 2019 continuous-control Continuous Control
— Unverified 0Clustered Reinforcement Learning Jun 6, 2019 Atari Games Clustering
— Unverified 0Continuous Control for Automated Lane Change Behavior Based on Deep Deterministic Policy Gradient Algorithm Jun 5, 2019 continuous-control Continuous Control
— Unverified 0Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction Jun 3, 2019 continuous-control Continuous Control
Code Code Available 0Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator May 30, 2019 continuous-control Continuous Control
— Unverified 0Policy Search by Target Distribution Learning for Continuous Control May 27, 2019 continuous-control Continuous Control
— Unverified 0Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction May 27, 2019 continuous-control Continuous Control
— Unverified 0MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies May 23, 2019 continuous-control Continuous Control
Code Code Available 0Recurrent Value Functions May 23, 2019 continuous-control Continuous Control
— Unverified 0Combine PPO with NES to Improve Exploration May 23, 2019 continuous-control Continuous Control
— Unverified 0COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration May 22, 2019 continuous-control Continuous Control
Code Code Available 0