MO2: Model-Based Offline Options Sep 5, 2022 continuous-control Continuous Control
— Unverified 0Normality-Guided Distributional Reinforcement Learning for Continuous Control Aug 28, 2022 continuous-control Continuous Control
— Unverified 0Improvement of Sliding Mode Control Strategy Founded on Cascaded Doubly Fed Induction Generator Powered by a Matrix Converter Aug 20, 2022 continuous-control Continuous Control
— Unverified 0Cooperative guidance of multiple missiles: a hybrid co-evolutionary approach Aug 15, 2022 continuous-control Continuous Control
— Unverified 0DDX7: Differentiable FM Synthesis of Musical Instrument Sounds Aug 12, 2022 continuous-control Continuous Control
— Unverified 0Sequence Model Imitation Learning with Unobserved Contexts Aug 3, 2022 continuous-control Continuous Control
Code Code Available 0Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach Aug 1, 2022 continuous-control Continuous Control
Code Code Available 0Meta Reinforcement Learning with Successor Feature Based Context Jul 29, 2022 continuous-control Continuous Control
— Unverified 0Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control Jul 27, 2022 continuous-control Continuous Control
— Unverified 0Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms Jul 27, 2022 continuous-control Continuous Control
Code Code Available 0Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy Jul 25, 2022 continuous-control Continuous Control
Code Code Available 0Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution Jul 22, 2022 Algorithmic Trading continuous-control
— Unverified 0Minimum Description Length Control Jul 17, 2022 Bayesian Inference continuous-control
— Unverified 0Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces Jul 12, 2022 continuous-control Continuous Control
Code Code Available 0Learning Bellman Complete Representations for Offline Policy Evaluation Jul 12, 2022 continuous-control Continuous Control
Code Code Available 0Compactly Restrictable Metric Policy Optimization Problems Jul 12, 2022 continuous-control Continuous Control
— Unverified 0Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for Planning Jul 11, 2022 continuous-control Continuous Control
— Unverified 0Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization Jul 5, 2022 continuous-control Continuous Control
Code Code Available 0General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States Jul 4, 2022 continuous-control Continuous Control
Code Code Available 0Offline Policy Optimization with Eligible Actions Jul 1, 2022 continuous-control Continuous Control
Code Code Available 0Depth-CUPRL: Depth-Imaged Contrastive Unsupervised Prioritized Representations in Reinforcement Learning for Mapless Navigation of Unmanned Aerial Vehicles Jun 30, 2022 continuous-control Continuous Control
— Unverified 0Guided Exploration in Reinforcement Learning via Monte Carlo Critic Optimization Jun 25, 2022 continuous-control Continuous Control
Code Code Available 0Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision Jun 23, 2022 continuous-control Continuous Control
— Unverified 0Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming Jun 22, 2022 Autonomous Driving Classification
— Unverified 0Generalised Policy Improvement with Geometric Policy Composition Jun 17, 2022 continuous-control Continuous Control
— Unverified 0Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning Jun 15, 2022 Autonomous Driving continuous-control
— Unverified 0Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning Jun 14, 2022 continuous-control Continuous Control
Code Code Available 0Defending Observation Attacks in Deep Reinforcement Learning via Detection and Denoising Jun 14, 2022 continuous-control Continuous Control
Code Code Available 0Relative Policy-Transition Optimization for Fast Policy Transfer Jun 13, 2022 continuous-control Continuous Control
— Unverified 0Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies Jun 12, 2022 continuous-control Continuous Control
— Unverified 0Model-based Offline Imitation Learning with Non-expert Data Jun 11, 2022 continuous-control Continuous Control
— Unverified 0Overcoming the Spectral Bias of Neural Value Approximation Jun 9, 2022 continuous-control Continuous Control
— Unverified 0Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance Jun 8, 2022 continuous-control Continuous Control
— Unverified 0ARC -- Actor Residual Critic for Adversarial Imitation Learning Jun 5, 2022 ARC continuous-control
— Unverified 0Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning Jun 2, 2022 continuous-control Continuous Control
— Unverified 0Minimax Optimal Online Imitation Learning via Replay Estimation May 30, 2022 continuous-control Continuous Control
Code Code Available 0TaSIL: Taylor Series Imitation Learning May 30, 2022 continuous-control Continuous Control
Code Code Available 0Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning May 29, 2022 Continuous Control Deep Reinforcement Learning
— Unverified 0Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning May 28, 2022 Continuous Control Model-based Reinforcement Learning
— Unverified 0SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning May 26, 2022 continuous-control Continuous Control
— Unverified 0Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning May 25, 2022 continuous-control Continuous Control
Code Code Available 0Efficient Reinforcement Learning from Demonstration Using Local Ensemble and Reparameterization with Split and Merge of Expert Policies May 23, 2022 continuous-control Continuous Control
— Unverified 0IL-flOw: Imitation Learning from Observation using Normalizing Flows May 19, 2022 continuous-control Continuous Control
— Unverified 0Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks May 18, 2022 continuous-control Continuous Control
Code Code Available 0A cGAN Ensemble-based Uncertainty-aware Surrogate Model for Offline Model-based Optimization in Industrial Control Problems May 15, 2022 continuous-control Continuous Control
— Unverified 0A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning May 11, 2022 continuous-control Continuous Control
Code Code Available 0Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods May 8, 2022 continuous-control Continuous Control
Code Code Available 0Skill-based Meta-Reinforcement Learning Apr 25, 2022 continuous-control Continuous Control
— Unverified 0Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach Apr 21, 2022 continuous-control Continuous Control
— Unverified 0SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics Apr 20, 2022 continuous-control Continuous Control
— Unverified 0