Policy Synthesis and Reinforcement Learning for Discounted LTL May 26, 2023 PAC learning reinforcement-learning
— Unverified 0A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents May 26, 2023 Instruction Following Reinforcement Learning (RL)
Code Code Available 0Distributional Reinforcement Learning with Dual Expectile-Quantile Regression May 26, 2023 Continuous Control Distributional Reinforcement Learning
— Unverified 0Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback May 26, 2023 Reinforcement Learning (RL)
— Unverified 0Generating Synergistic Formulaic Alpha Collections via Reinforcement Learning May 25, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 3Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory May 25, 2023 Common Sense Reasoning CPU
Code Code Available 2Reward-Machine-Guided, Self-Paced Reinforcement Learning May 25, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models May 25, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes May 25, 2023 Bayesian Optimisation Inductive Bias
Code Code Available 0Deterministic policy gradient based optimal control with probabilistic constraints May 25, 2023 Model Predictive Control reinforcement-learning
— Unverified 0Market Making with Deep Reinforcement Learning from Limit Order Books May 25, 2023 Decision Making Deep Reinforcement Learning
Code Code Available 1PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning May 25, 2023 Computational Efficiency reinforcement-learning
Code Code Available 1Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees May 24, 2023 Reinforcement Learning (RL)
Code Code Available 0Control invariant set enhanced safe reinforcement learning: improved sampling efficiency, guaranteed stability and robustness May 24, 2023 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure May 24, 2023 Matrix Completion reinforcement-learning
— Unverified 0Deep Reinforcement Learning with Plasticity Injection May 24, 2023 Computational Efficiency Deep Reinforcement Learning
— Unverified 0SPRING: Studying the Paper and Reasoning to Play Games May 24, 2023 Language Modelling Large Language Model
Code Code Available 1A Mini Review on the utilization of Reinforcement Learning with OPC UA May 24, 2023 Decision Making reinforcement-learning
— Unverified 0Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning May 24, 2023 Offline RL Reinforcement Learning (RL)
Code Code Available 1Conditional Mutual Information for Disentangled Representations in Reinforcement Learning May 23, 2023 continuous-control Continuous Control
Code Code Available 1When should we prefer Decision Transformers for Offline Reinforcement Learning? May 23, 2023 D4RL Imitation Learning
Code Code Available 1Constrained Proximal Policy Optimization May 23, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry May 23, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning May 23, 2023 Diversity reinforcement-learning
— Unverified 0Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML May 23, 2023 Bayesian Optimization Hyperparameter Optimization
— Unverified 0Lagrangian-based online safe reinforcement learning for state-constrained systems May 22, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice May 22, 2023 regression Reinforcement Learning (RL)
Code Code Available 0Policy Representation via Diffusion Probability Model for Reinforcement Learning May 22, 2023 continuous-control Continuous Control
Code Code Available 1INVICTUS: Optimizing Boolean Logic Circuit Synthesis via Synergistic Learning and Search May 22, 2023 Reinforcement Learning (RL)
— Unverified 0Offline Primal-Dual Reinforcement Learning for Linear MDPs May 22, 2023 Offline RL reinforcement-learning
— Unverified 0Adaptive action supervision in reinforcement learning from real-world multi-agent demonstrations May 22, 2023 Dynamic Time Warping reinforcement-learning
— Unverified 0FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation May 22, 2023 Imitation Learning Motion Planning
Code Code Available 2Towards Optimal Energy Management Strategy for Hybrid Electric Vehicle with Reinforcement Learning May 21, 2023 energy management Management
— Unverified 0BertRLFuzzer: A BERT and Reinforcement Learning Based Fuzzer May 21, 2023 16k reinforcement-learning
Code Code Available 0SneakyPrompt: Jailbreaking Text-to-image Generative Models May 20, 2023 Reinforcement Learning (RL) Semantic Similarity
Code Code Available 1Model-based adaptation for sample efficient transfer in reinforcement learning control of parameter-varying systems May 20, 2023 Model Predictive Control reinforcement-learning
— Unverified 0Understanding the World to Solve Social Dilemmas Using Multi-Agent Reinforcement Learning May 19, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Learning Diverse Risk Preferences in Population-based Self-play May 19, 2023 Diversity reinforcement-learning
Code Code Available 1The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond May 18, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems May 18, 2023 Recommendation Systems reinforcement-learning
Code Code Available 1Client Selection for Federated Policy Optimization with Environment Heterogeneity May 18, 2023 MuJoCo Policy Gradient Methods
Code Code Available 0Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL May 18, 2023 Reinforcement Learning (RL)
— Unverified 0Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum May 17, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1A Genetic Fuzzy System for Interpretable and Parsimonious Reinforcement Learning Policies May 17, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning May 17, 2023 Offline RL reinforcement-learning
— Unverified 0Pittsburgh Learning Classifier Systems for Explainable Reinforcement Learning: Comparing with XCS May 17, 2023 Explainable Artificial Intelligence (XAI) reinforcement-learning
Code Code Available 0Revisiting the Minimalist Approach to Offline Reinforcement Learning May 16, 2023 D4RL Offline RL
Code Code Available 1Cooperation Is All You Need May 16, 2023 All Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions May 16, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Coagent Networks: Generalized and Scaled May 16, 2023 MuJoCo Reinforcement Learning (RL)
— Unverified 0