Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges Jul 31, 2023 Reinforcement Learning (RL) Survey
— Unverified 0Reinforcement Learning Under Probabilistic Spatio-Temporal Constraints with Time Windows Jul 29, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Using Implicit Behavior Cloning and Dynamic Movement Primitive to Facilitate Reinforcement Learning for Robot Motion Planning Jul 29, 2023 Motion Planning Reinforcement Learning (RL)
— Unverified 0PIMbot: Policy and Incentive Manipulation for Multi-Robot Reinforcement Learning in Social Dilemmas Jul 29, 2023 Reinforcement Learning (RL)
Code Code Available 0Shrink-Perturb Improves Architecture Mixing during Population Based Training for Neural Architecture Search Jul 28, 2023 Hyperparameter Optimization Image Generation
Code Code Available 0Primitive Skill-based Robot Learning from Human Evaluative Feedback Jul 28, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0TrackAgent: 6D Object Tracking via Reinforcement Learning Jul 28, 2023 Object Object Tracking
— Unverified 0Dialogue Shaping: Empowering Agents through NPC Interaction Jul 28, 2023 Knowledge Graphs reinforcement-learning
— Unverified 0ETHER: Aligning Emergent Communication for Hindsight Experience Replay Jul 28, 2023 Inductive Bias Instruction Following
— Unverified 0Approximate Model-Based Shielding for Safe Reinforcement Learning Jul 27, 2023 Atari Games model
Code Code Available 0Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation Jul 26, 2023 Image-to-Image Translation Reinforcement Learning (RL)
— Unverified 0Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks Jul 26, 2023 Decision Making LEMMA
— Unverified 0Reinforcement Learning by Guided Safe Exploration Jul 26, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Mode-constrained Model-based Reinforcement Learning via Gaussian Processes Jul 25, 2023 Gaussian Processes Model-based Reinforcement Learning
Code Code Available 0Unbiased Weight Maximization Jul 25, 2023 Reinforcement Learning (RL)
— Unverified 0Structural Credit Assignment with Coordinated Exploration Jul 25, 2023 Reinforcement Learning (RL)
— Unverified 0The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation Jul 25, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASH Jul 25, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Offline Reinforcement Learning with On-Policy Q-Function Regularization Jul 25, 2023 D4RL reinforcement-learning
— Unverified 0Settling the Sample Complexity of Online Reinforcement Learning Jul 25, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Counterfactual Explanation Policies in RL Jul 25, 2023 counterfactual Counterfactual Explanation
— Unverified 0Communication-Efficient Orchestrations for URLLC Service via Hierarchical Reinforcement Learning Jul 25, 2023 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0ExWarp: Extrapolation and Warping-based Temporal Supersampling for High-frequency Displays Jul 24, 2023 Reinforcement Learning (RL)
— Unverified 0Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning Jul 24, 2023 continuous-control Continuous Control
— Unverified 0On the Effectiveness of Offline RL for Dialogue Response Generation Jul 23, 2023 Offline RL reinforcement-learning
Code Code Available 0DIP-RL: Demonstration-Inferred Preference Learning in Minecraft Jul 22, 2023 Decision Making Minecraft
— Unverified 0Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations Jul 22, 2023 continuous-control Continuous Control
— Unverified 0Bridging the Reality Gap of Reinforcement Learning based Traffic Signal Control using Domain Randomization and Meta Learning Jul 21, 2023 Meta-Learning Reinforcement Learning (RL)
— Unverified 0Towards practical reinforcement learning for tokamak magnetic control Jul 21, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reparameterized Policy Learning for Multimodal Trajectory Optimization Jul 20, 2023 Reinforcement Learning (RL)
— Unverified 0A reinforcement learning approach for VQA validation: an application to diabetic macular edema grading Jul 19, 2023 Medical Image Analysis Question Answering
— Unverified 0Distributed 3D-Beam Reforming for Hovering-Tolerant UAVs Communication over Coexistence: A Deep-Q Learning for Intelligent Space-Air-Ground Integrated Networks Jul 18, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Continuous-Time Reinforcement Learning: New Design Algorithms with Theoretical Insights and Performance Guarantees Jul 18, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Data Cross-Segmentation for Improved Generalization in Reinforcement Learning Based Algorithmic Trading Jul 18, 2023 Algorithmic Trading reinforcement-learning
— Unverified 0IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness Jul 18, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Towards A Unified Agent with Foundation Models Jul 18, 2023 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0REX: Rapid Exploration and eXploitation for AI Agents Jul 18, 2023 AI Agent Decision Making
— Unverified 0Quarl: A Learning-Based Quantum Circuit Optimizer Jul 17, 2023 Reinforcement Learning (RL)
— Unverified 0Basal-Bolus Advisor for Type 1 Diabetes (T1D) Patients Using Multi-Agent Reinforcement Learning (RL) Methodology Jul 17, 2023 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient Jul 17, 2023 Reinforcement Learning (RL)
— Unverified 0Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning Jul 16, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance Jul 16, 2023 Decision Making Deep Reinforcement Learning
Code Code Available 0Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning Jul 16, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation Jul 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty Jul 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets Jul 15, 2023 Drug Discovery Reinforcement Learning (RL)
— Unverified 0Combining model-predictive control and predictive reinforcement learning for stable quadrupedal robot locomotion Jul 15, 2023 Model Predictive Control reinforcement-learning
— Unverified 0Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative Jul 13, 2023 Reinforcement Learning (RL)
— Unverified 0Transformers in Reinforcement Learning: A Survey Jul 12, 2023 Cloud Computing Combinatorial Optimization
— Unverified 0Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior Jul 12, 2023 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0