Policy Gradient for Reinforcement Learning with General Utilities Oct 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Square-root regret bounds for continuous-time episodic Markov decision processes Oct 3, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0MSRL: Distributed Reinforcement Learning with Dataflow Fragments Oct 3, 2022 CPU GPU
— Unverified 0Mastering Spatial Graph Prediction of Road Networks Oct 3, 2022 Prediction Reinforcement Learning (RL)
— Unverified 0Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation Oct 3, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation Oct 2, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Robust Bayesian optimization with reinforcement learned acquisition functions Oct 2, 2022 Bayesian Optimization reinforcement-learning
— Unverified 0Policy Gradients for Probabilistic Constrained Reinforcement Learning Oct 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model Oct 2, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0GFlowNets and variational inference Oct 2, 2022 Diversity Reinforcement Learning (RL)
Code Code Available 0Learning-Based Adaptive Optimal Control of Linear Time-Delay Systems: A Policy Iteration Approach Oct 1, 2022 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Comparing BERT-based Reward Functions for Deep Reinforcement Learning in Machine Translation Oct 1, 2022 Deep Reinforcement Learning Machine Translation
— Unverified 0Bayesian Q-learning With Imperfect Expert Demonstrations Oct 1, 2022 Atari Games Q-Learning
— Unverified 0Can Data Diversity Enhance Learning Generalization? Oct 1, 2022 Diversity Domain Adaptation
— Unverified 0Parsing Natural Language into Propositional and First-Order Logic with Dual Reinforcement Learning Oct 1, 2022 Natural Language Inference reinforcement-learning
— Unverified 0Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning Oct 1, 2022 Disentanglement Meta Reinforcement Learning
— Unverified 0RL-MD: A Novel Reinforcement Learning Approach for DNA Motif Discovery Sep 30, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Safe Exploration Method for Reinforcement Learning under Existence of Disturbance Sep 30, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Reward Shaping for User Satisfaction in a REINFORCE Recommender Sep 30, 2022 Imputation Reinforcement Learning (RL)
— Unverified 0Bounded Robustness in Reinforcement Learning via Lexicographic Objectives Sep 30, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Programmable Control of Ultrasound Swarmbots through Reinforcement Learning Sep 30, 2022 Diagnostic Navigate
— Unverified 0S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning Sep 30, 2022 Data Augmentation Image Generation
Code Code Available 0Towards a Fully Autonomous UAV Controller for Moving Platform Detection and Landing Sep 30, 2022 Reinforcement Learning (RL)
— Unverified 0The Role of Time Delay in Sim2real Transfer of Reinforcement Learning for Cyber-Physical Systems Sep 30, 2022 Reinforcement Learning (RL)
— Unverified 0Efficient LSTM Training with Eligibility Traces Sep 30, 2022 Q-Learning Reinforcement Learning (RL)
— Unverified 0B2RL: An open-source Dataset for Building Batch Reinforcement Learning Sep 30, 2022 Management reinforcement-learning
Code Code Available 0Efficiently Learning Small Policies for Locomotion and Manipulation Sep 30, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0ASPiRe:Adaptive Skill Priors for Reinforcement Learning Sep 30, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Improving Policy Learning via Language Dynamics Distillation Sep 30, 2022 NetHack Reinforcement Learning (RL)
Code Code Available 0A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning Sep 30, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training Sep 29, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Contrastive Unsupervised Learning of World Model with Invariant Causal Features Sep 29, 2022 Data Augmentation Depth Estimation
— Unverified 0How Does Return Distribution in Distributional Reinforcement Learning Help Optimization? Sep 29, 2022 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments Sep 29, 2022 Decision Making reinforcement-learning
— Unverified 0Learning Low-Frequency Motion Control for Robust and Dynamic Robot Locomotion Sep 29, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments Sep 29, 2022 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0Learning Parsimonious Dynamics for Generalization in Reinforcement Learning Sep 29, 2022 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making Sep 29, 2022 Decision Making Model-based Reinforcement Learning
— Unverified 0Reinforcement Learning Algorithms: An Overview and Classification Sep 29, 2022 Classification reinforcement-learning
— Unverified 0Scaling Laws for a Multi-Agent Reinforcement Learning Model Sep 29, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms Sep 29, 2022 Reinforcement Learning (RL)
— Unverified 0Online Weighted Q-Ensembles for Reduced Hyperparameter Tuning in Reinforcement Learning Sep 29, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Combining Reinforcement Learning and Tensor Networks, with an Application to Dynamical Large Deviations Sep 28, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Predictive Crypto-Asset Automated Market Making Architecture for Decentralized Finance using Deep Reinforcement Learning Sep 28, 2022 Deep Reinforcement Learning Q-Learning
— Unverified 0Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping Sep 28, 2022 Collision Avoidance Deep Reinforcement Learning
— Unverified 0Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees Sep 28, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Online Policy Optimization for Robust MDP Sep 28, 2022 Reinforcement Learning (RL)
— Unverified 0FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations Sep 28, 2022 Autonomous Driving Edge-computing
— Unverified 0Guiding Safe Exploration with Weakest Preconditions Sep 28, 2022 continuous-control Continuous Control
— Unverified 0Disentangling Transfer in Continual Reinforcement Learning Sep 28, 2022 Continual Learning continuous-control
— Unverified 0