Offline Learning of Closed-Loop Deep Brain Stimulation Controllers for Parkinson Disease Treatment Feb 5, 2023 Reinforcement Learning (RL)
Code Code Available 0Model-free Quantum Gate Design and Calibration using Deep Reinforcement Learning Feb 5, 2023 Deep Reinforcement Learning Model free quantum gate design
Code Code Available 0An Online Model-Following Projection Mechanism Using Reinforcement Learning Feb 5, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Generalization of Deep Reinforcement Learning for Jammer-Resilient Frequency and Power Allocation Feb 4, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Traffic Light Control in Intelligent Transportation Systems Feb 4, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Online Reinforcement Learning in Non-Stationary Context-Driven Environments Feb 4, 2023 MuJoCo reinforcement-learning
Code Code Available 0Developing Driving Strategies Efficiently: A Skill-Based Hierarchical Reinforcement Learning Approach Feb 4, 2023 Autonomous Driving Hierarchical Reinforcement Learning
— Unverified 0Reinforcement Learning in Low-Rank MDPs with Density Features Feb 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning with History-Dependent Dynamic Contexts Feb 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Learning to Optimize for Reinforcement Learning Feb 3, 2023 Inductive Bias Meta-Learning
Code Code Available 1Deep Reinforcement Learning for Online Error Detection in Cyber-Physical Systems Feb 3, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Cyber System Defense under Dynamic Adversarial Uncertainties Feb 3, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Reinforcing User Retention in a Billion Scale Short Video Recommender System Feb 3, 2023 Recommendation Systems reinforcement-learning
— Unverified 0Mind the Gap: Offline Policy Optimization for Imperfect Rewards Feb 3, 2023 Reinforcement Learning (RL)
Code Code Available 1Two-Stage Constrained Actor-Critic for Short Video Recommendation Feb 3, 2023 Recommendation Systems reinforcement-learning
Code Code Available 1Distributional constrained reinforcement learning for supply chain optimization Feb 3, 2023 Distributional Reinforcement Learning Policy Gradient Methods
Code Code Available 0Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms Feb 2, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition Feb 2, 2023 Diversity Q-Learning
— Unverified 0ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints Feb 2, 2023 OpenAI Gym Reinforcement Learning (RL)
— Unverified 0Policy Expansion for Bridging Offline-to-Online Reinforcement Learning Feb 2, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs Feb 2, 2023 continuous-control Continuous Control
— Unverified 0MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks Feb 2, 2023 Reinforcement Learning (RL)
— Unverified 0Lower Bounds for Learning in Revealing POMDPs Feb 2, 2023 Reinforcement Learning (RL)
— Unverified 0Multi-zone HVAC Control with Model-Based Deep Reinforcement Learning Feb 1, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Selective Uncertainty Propagation in Offline RL Feb 1, 2023 Offline RL reinforcement-learning
— Unverified 0Internally Rewarded Reinforcement Learning Feb 1, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1Sample Complexity of Kernel-Based Q-Learning Feb 1, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Off-the-Grid MARL: Datasets with Baselines for Offline Multi-Agent Reinforcement Learning Feb 1, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 2Combining Deep Reinforcement Learning and Search with Generative Models for Game-Theoretic Opponent Modeling Feb 1, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Collaborating with language models for embodied reasoning Feb 1, 2023 In-Context Learning Language Modeling
— Unverified 0Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO) Feb 1, 2023 MuJoCo reinforcement-learning
— Unverified 0QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing Feb 1, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Optimizing DDPM Sampling with Shortcut Fine-Tuning Jan 31, 2023 Denoising Reinforcement Learning (RL)
Code Code Available 1Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding Jan 31, 2023 Atari Games Evolutionary Algorithms
— Unverified 0A Reinforcement Learning Framework for Dynamic Mediation Analysis Jan 31, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Skill Decision Transformer Jan 31, 2023 D4RL Descriptive
Code Code Available 0Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning Jan 31, 2023 Edge-computing Management
— Unverified 0Scalable Grid-Aware Dynamic Matching using Deep Reinforcement Learning Jan 31, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees Jan 31, 2023 continuous-control Continuous Control
Code Code Available 1Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning Jan 31, 2023 Active Learning Computational Efficiency
Code Code Available 0Towards interpretable quantum machine learning via single-photon quantum walks Jan 31, 2023 Decision Making Quantum Machine Learning
— Unverified 0Retrosynthetic Planning with Dual Value Networks Jan 31, 2023 Drug Discovery Multi-step retrosynthesis
Code Code Available 1Scaling laws for single-agent reinforcement learning Jan 31, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning Jan 31, 2023 Decoder reinforcement-learning
Code Code Available 0Execution-based Code Generation using Deep Reinforcement Learning Jan 31, 2023 Code Completion Code Generation
Code Code Available 1Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments Jan 31, 2023 Reinforcement Learning (RL) Retrieval
Code Code Available 1Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks Jan 31, 2023 Blocking Graph Neural Network
— Unverified 0Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning Jan 30, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0STEEL: Singularity-aware Reinforcement Learning Jan 30, 2023 Off-policy evaluation reinforcement-learning
— Unverified 0PAC-Bayesian Soft Actor-Critic Learning Jan 30, 2023 Reinforcement Learning (RL)
Code Code Available 0