On the Reuse Bias in Off-Policy Reinforcement Learning Sep 15, 2022 continuous-control Continuous Control
Code Code Available 0FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN Parameters Sep 8, 2022 Benchmarking continuous-control
Code Code Available 0MO2: Model-Based Offline Options Sep 5, 2022 continuous-control Continuous Control
— Unverified 0Actor Prioritized Experience Replay Sep 1, 2022 continuous-control Continuous Control
Code Code Available 1Normality-Guided Distributional Reinforcement Learning for Continuous Control Aug 28, 2022 continuous-control Continuous Control
— Unverified 0Efficient Planning in a Compact Latent Action Space Aug 22, 2022 continuous-control Continuous Control
Code Code Available 1Improvement of Sliding Mode Control Strategy Founded on Cascaded Doubly Fed Induction Generator Powered by a Matrix Converter Aug 20, 2022 continuous-control Continuous Control
— Unverified 0PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm Aug 16, 2022 continuous-control Continuous Control
Code Code Available 1Cooperative guidance of multiple missiles: a hybrid co-evolutionary approach Aug 15, 2022 continuous-control Continuous Control
— Unverified 0DDX7: Differentiable FM Synthesis of Musical Instrument Sounds Aug 12, 2022 continuous-control Continuous Control
— Unverified 0Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement Learning Aug 11, 2022 continuous-control Continuous Control
Code Code Available 1Sequence Model Imitation Learning with Unobserved Contexts Aug 3, 2022 continuous-control Continuous Control
Code Code Available 0Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach Aug 1, 2022 continuous-control Continuous Control
Code Code Available 0Meta Reinforcement Learning with Successor Feature Based Context Jul 29, 2022 continuous-control Continuous Control
— Unverified 0Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control Jul 27, 2022 continuous-control Continuous Control
— Unverified 0Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms Jul 27, 2022 continuous-control Continuous Control
Code Code Available 0Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy Jul 25, 2022 continuous-control Continuous Control
Code Code Available 0Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution Jul 22, 2022 Algorithmic Trading continuous-control
— Unverified 0Minimum Description Length Control Jul 17, 2022 Bayesian Inference continuous-control
— Unverified 0Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces Jul 12, 2022 continuous-control Continuous Control
Code Code Available 0Compactly Restrictable Metric Policy Optimization Problems Jul 12, 2022 continuous-control Continuous Control
— Unverified 0Learning Bellman Complete Representations for Offline Policy Evaluation Jul 12, 2022 continuous-control Continuous Control
Code Code Available 0Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for Planning Jul 11, 2022 continuous-control Continuous Control
— Unverified 0Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization Jul 5, 2022 continuous-control Continuous Control
Code Code Available 0Goal-Conditioned Generators of Deep Policies Jul 4, 2022 continuous-control Continuous Control
Code Code Available 1General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States Jul 4, 2022 continuous-control Continuous Control
Code Code Available 0Offline Policy Optimization with Eligible Actions Jul 1, 2022 continuous-control Continuous Control
Code Code Available 0Depth-CUPRL: Depth-Imaged Contrastive Unsupervised Prioritized Representations in Reinforcement Learning for Mapless Navigation of Unmanned Aerial Vehicles Jun 30, 2022 continuous-control Continuous Control
— Unverified 0Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse Jun 28, 2022 Continuous Control Decision Making
Code Code Available 1Guided Exploration in Reinforcement Learning via Monte Carlo Critic Optimization Jun 25, 2022 continuous-control Continuous Control
Code Code Available 0Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision Jun 23, 2022 continuous-control Continuous Control
— Unverified 0Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming Jun 22, 2022 Autonomous Driving Classification
— Unverified 0Generalised Policy Improvement with Geometric Policy Composition Jun 17, 2022 continuous-control Continuous Control
— Unverified 0Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning Jun 15, 2022 Autonomous Driving continuous-control
— Unverified 0Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning Jun 14, 2022 continuous-control Continuous Control
Code Code Available 0Defending Observation Attacks in Deep Reinforcement Learning via Detection and Denoising Jun 14, 2022 continuous-control Continuous Control
Code Code Available 0Transformers are Meta-Reinforcement Learners Jun 14, 2022 continuous-control Continuous Control
Code Code Available 1Relative Policy-Transition Optimization for Fast Policy Transfer Jun 13, 2022 continuous-control Continuous Control
— Unverified 0Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies Jun 12, 2022 continuous-control Continuous Control
— Unverified 0Model-based Offline Imitation Learning with Non-expert Data Jun 11, 2022 continuous-control Continuous Control
— Unverified 0Imitation Learning via Differentiable Physics Jun 10, 2022 continuous-control Continuous Control
Code Code Available 1Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk Jun 9, 2022 continuous-control Continuous Control
Code Code Available 1Overcoming the Spectral Bias of Neural Value Approximation Jun 9, 2022 continuous-control Continuous Control
— Unverified 0Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations Jun 9, 2022 Benchmarking continuous-control
Code Code Available 2Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance Jun 8, 2022 continuous-control Continuous Control
— Unverified 0ARC -- Actor Residual Critic for Adversarial Imitation Learning Jun 5, 2022 ARC continuous-control
— Unverified 0Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning Jun 2, 2022 continuous-control Continuous Control
— Unverified 0Minimax Optimal Online Imitation Learning via Replay Estimation May 30, 2022 continuous-control Continuous Control
Code Code Available 0TaSIL: Taylor Series Imitation Learning May 30, 2022 continuous-control Continuous Control
Code Code Available 0RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch May 30, 2022 Continuous Control Deep Reinforcement Learning
Code Code Available 1